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ABSTRACT 


Spectral  estimation  for  multiple  2-D  signals  by  model-based  methods  is  developed. 
The  procedures  compute  the  entire  spectral  matrix  of  autospectra  and  cross  spectra  for 
the  set  of  2-D  signals.  Spectral  analysis  by  autoregressive  (AR)  modeling  is  studied 
extensively.  Specific  differences  between  AR  models  for  this  problem  and  those  for 
lower  dimensional  problems  are  highlighted.  An  extension  of  the  Jackson-Chien  method 
for  combining  estimates  with  single  quadrant  support  is  proposed  and  a  method  is 
developed  for  estimating  the  model  parameters  direcdy  from  the  data  (i.e.  without  prior 
estimation  of  a  correlation  matrix).  A  measure  of  the  similarity  of  two  spectral  estimates 
based  on  the  statistical  divergence  is  proposed  and  used  to  compare  various  spectral  esti¬ 
mates.  A  comprehensive  set  of  experimental  studies  are  presented  showing  the  perfor¬ 
mance  of  the  methods  in  estimating  the  autospectra  and  magnitude  and  phase  of  the  cross 
spectra.  The  Maximum  Likelihood  Method  (MLM)  of  spectral  estimation  is  extended  to 
the  multichannel  2-D  case.  The  properties  are  compared  experimentally  with  the  autore¬ 
gressive  methods.  The  Improved  Maximum  Likelihood  Method  (IMLM)  is  also 
developed  for  the  multichannel  case.  Finally  applications  of  multichannel  2-D  spectral 
analysis  models  to  image  coding  are  presented. 
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L  INTRODUCTION 

Model-based  methods  of  spectrum  estimation  have  received  considerable  attention 
over  the  last  fifteen  or  twenty  years  because  of  their  ability  to  provide  high  resolution 
with  limited  data  and  potentially  more  accurate  estimates.  A  lot  of  work  has  been  done 
for  model-based  methods  of  spectrum  estimation  of  one  dimensional  (1-D)  random 
processes  with  the  development  of  methods  such  as  maximum  entropy,  autoregressive 
(AR),  and  maximum  likelihood.  A  review  of  the  model-based  methods  of  spectral 
analysis  in  the  1-D  case  can  be  found  in  the  references  (Childers,  1978;  Kesler,1986;  Kay 
and  Marple.1981;  Nuttal,1976;  Proc.  of  RADC  Spectrum  Estimation  Workshop,  1978; 
Robinson,  1982;  Dudgeon  and  Mersereau,1977;  Marple,1987).  Unfortunately  much  of 
the  work  done  in  1-D  spectral  estimation  does  not  extend  easily  to  the  two-dimensional 
(2-D)  case.  This  is  true  in  particular  for  the  maximum  entropy  and  AR  spectral  estimates 
which  are  identical  in  the  1-D  case  but  may  be  significantly  different  in  the  2-D  case. 
Indeed,  while  a  2-D  AR  spectral  estimate  can  be  computed  from  any  estimated  correla¬ 
tion  function,  the  corresponding  maximum  entropy  spectral  estimate  may  not  even  exist. 
McClellan  (1982)  gives  a  review  of  methods  of  multidimensional  power  spectrum  esti¬ 
mation.  Lang  and  McClellan  (1982,  1983)  focus  attention  on  2-D  maximum  entropy 
power  spectrum  estimation  and  show  the  specific  differences  between  the  2-D  and  the 

1- D  case. 

This  thesis  addresses  the  problem  of  spectrum  estimation  for  multiple  2-D  signals. 
We  refer  to  the  set  of  signals  as  a  multichannel  2-D  signal.  Multichannel  2-D  signals 
can  be  viewed  as  several  planes  of  correlated  2-D  data  (see  Fig.  1.1).  For  instance,  a 
digital  color  image  may  be  considered  as  a  sample  function  of  a  discrete  three-channel 

2- D  random  signal  consisting  of  three  registered  components  representing  red,  green,  and 
blue  intensities.  Other  examples  of  multichannel  2-D  signals  can  be  found  in  array  pro¬ 
cessing,  in  certain  radar  applications,  and  in  the  set  of  images  received  from  a  satellite 
multispectral  scanner. 

The  spectrum  estimation  methods  that  will  be  applied  to  these  signals  compute  the 
entire  spectral  matrix.  That  is,  estimates  for  the  2-D  autospectra  and  magnitude  and 
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phase  of  the  cross  spectra  are  produced  all  at  once.  This  thesis  describes  some  methods 
of  spectrum  estimation  that  are  model-based  and  can  provide  better  resolution  than  con¬ 
ventional  Fourier  transform  methods.  It  will  be  seen  that  the  methods  we  describe  have 
characteristics  of  both  1-D  multichannel  problems  and  2-D  single  channel  problems  and 
so  are  considerably  different  from  3-D  spectrum  estimation  procedures.  The  latter  are 
almost  identical  to  2-D  single  channel  problems. 

In  this  chapter  we  define  the  problem  of  spectral  analysis  for  multiple  2-D  random 
signals  and  oudine  our  approach  to  this  problem.  We  give  a  brief  description  of  linear 
prediction  as  it  relates  to  spectral  estimation,  and  an  outline  of  some  previous  work  in 
spectral  estimation  for  single  2-D  signals.  We  also  indicate  how  the  models  we 
develop  can  be  applied  to  other  applications  such  as  coding. 

The  main  contributions  of  the  thesis  are  as  follows:  First,  the  multichannel  2-D  AR 
model  and  specific  differences  from  single  channel  models  are  described.  Models  with 
quadrant  plane  and  nonsymmetric  half  plane  support  and  their  forward  and  backward 
forms  are  defined.  It  is  shown  that  unlike  lower-dimensional  cases  the  forward  and  back¬ 
ward  models  produce  different  spectral  estimates.  Secondly,  an  estimate  for  the  spectral 
matrix  based  on  a  combination  of  models  with  first  and  second  quadrant  support  is 
presented  and  a  method  for  estimating  the  model  parameters  directly  from  the  data  (i.e. 
without  prior  estimation  of  a  correlation  function)  is  given.  Third,  a  measure  for  compar¬ 
ing  spectral  estimates  based  on  the  statistical  divergence  is  proposed  and  used  to  compare 
various  spectral  estimates.  Fourth,  a  comprehensive  set  of  experimental  studies  of  the 
spectral  estimation  procedures  is  presented.  Fifth,  the  Maximum  Likelihood  Method  and 
Improved  Maximum  Likelihood  Method  of  spectral  estimation  are  extended  to  the  mul¬ 
tichannel  2-D  case  and  results  are  compared  experimentally  to  power  spectral  estimation 
results  based  on  AR  modeling. 

Finally,  although  the  main  goal  of  this  thesis  was  multichannel  2-D  spectral  analysis 
a  large  part  of  the  work  had  to  deal  with  modeling  of  multichannel  2-D  random 
processes.  These  models  have  applications  to  signal  processing  problems  other  than 
spectral  analysis.  To  illustrate  this  versatility  we  have  applied  the  models  and  devoted 
one  chapter  of  the  thesis  to  the  problem  of  image  coding.  The  results  show  that  efficient 
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coding  of  color  images  can  be  obtained  using  general  models  that  have  been  derived 
from  analysis  of  several  different  types  of  color  images. 

A.  SIGNAL  MODELING  AND  LINEAR  PREDICTION 

A  large  class  of  methods  for  spectral  analysis  deals  with  modeling  the  signal  as  a 
linear  filter  driven  by  a  known  noise  source  (usually  white  noise).  One  class  of  modeling 
for  which  a  considerable  amount  of  work  has  been  done  is  the  AR  or  "all-pole"  model. 

In  this  case  the  random  signal  is  considered  as  the  output  of  a  2-D  purely  recursive  and 
stable  linear  filter  which  is  driven  by  white  noise.  Sometimes  this  model  is  called  the 
synthesis  model  since  it  may  be  thought  of  as  a  random  signal  generator.  The  inverse  of 
the  filter  used  in  the  AR  model,  which  is  also  causal,  is  a  2-D  FIR  filter  that  whitens  the 
random  process.  This  filter  is  thus  known  as  the  whitening  filter.  ) 

Another  class  of  models  is  defined  by  applying  linear  prediction  techniques.  The 
random  signal  sample  is  predicted  from  a  linear  combination  of  the  previous  N  samples 
weighted  by  a  set  of  filter  coefficients.  The  difference  between  the  actual  data  sample 
and  the  predicted  one  is  known  as  the  prediction  error  and  is  minimized  in  the  mean 
square  sense.  This  linear  predictive  model  is  also  called  a  minimum  mean  square  error 
(MMSE)  model.  (Jain,1981) 

The  distinction  between  the  AR  model  and  linear  predictive  model  lies  in  the  fact 
that  linear  prediction  does  not  always  produce  white  noise.  The  AR  and  linear  predictive 
models  are  identical  only  when  the  process  is  truly  described  by  an  AR  model  with  the 
support  region  used  for  linear  prediction.  In  this  case  a  white  noise  prediction  error  is 
obtained.  Fortunately  a  close  AR  representation  for  most  2-D  random  processes  can  be 
obtained  by  choosing  a  linear  predictive  model  with  sufficiently  large  nonsymmetric  half 
plane  support.  Theoretical  results  (Ekstrom  and  Woods,  1976;  Marzetta,  1978)  show  that 
NSHP  support  is  sufficient  to  represent  any  2-D  random  process  but  in  general  the  sup¬ 
port  will  be  infinite. 

The  idea  of  linear  prediction  is  therefore  to  determine  a  causal  filter  that  (approxi¬ 
mately)  whitens  a  given  random  process.  There  are  two  classes  of  linear  prediction  prob¬ 
lems.  In  one  case  we  are  given  a  finite  set  of  samples  from  the  random  process,  and 
we  want  to  estimate  the  causal  linear  predictive  whitening  filter.  We  refer  to  this 
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problem  as  the  AR  model  fitting  problem.  In  this  case  the  whitening  filter  is  modeled  as 
an  FIR  filter. 

In  the  second  class  of  linear  prediction  problems  we  are  given  the  actual  power  den¬ 
sity  spectrum  of  the  process,  and  the  idea  is  to  derive  from  it  the  causal  whitening  filter. 
This  problem  is  called  spectral  factorization  (Marzetta,  1978).  We  concentrate  in  this 
thesis  on  problems  in  the  first  class. 

The  two  most  popular  methods  of  AR  model  fitting  are  the  autocorrelation  method, 
and  the  Burg  algorithm.  In  the  first  method  estimation  of  the  model  parameters  is  based 
on  prior  estimation  of  the  correlation  function.  In  the  second  method  the  model  parame¬ 
ters  are  estimated  directly  from  the  data. 

In  1-D  an  important  representation  of  the  linear  predictive  filter  is  the  lattice  struc¬ 
ture.  In  this  case  the  system  is  represented  by  so-called  reflection  coefficients.  Various 
attempts  have  been  made  to  extend  the  lattice  to  2-D.  One  approach  was  suggested  by 
Marzetta  (1979).  He  defined  a  set  of  2-D  reflection  coefficients  and  a  corresponding  2-D 
lattice  implementation  of  the  linear  predictive  filter.  The  polynomial  representation  of 
the  2-D  minimum  phase  filter  corresponding  to  the  given  finite  set  of  reflection 
coefficients  will  have  a  finite  region  of  support.  Unfortunately  the  converse  of  this  state¬ 
ment  is  not  true.  If  the  polynomial  representation  is  defined  on  a  finite  region  of  support, 
the  corresponding  lattice  structure  representation  will  be  in  general  finite  in  one  direction 
and  infinite  in  the  other  direction.  In  either  case  the  stability  of  the  corresponding  IIR 
filter  is  guaranteed  if  the  magnitude  of  the  reflection  coefficients  is  less  than  one. 

In  another  approach  Parker  and  Kayran  (1984)  suggested  a  different  extension  of  1- 
D  lattice  AR  modeling  to  the  2-D  case.  Although  the  resulting  2-D  filter  has  some 
interesting  characteristics,  it  is  neither  a  whitening  filter  nor  a  minimum  mean  square 
error  prediction  filter. 

B.  SPECTRAL  ESTIMATION  METHODS 

Power  spectrum  estimation  has  progressed  through  several  stages  since  the  turn  of 
the  century.  The  first  estimation  methods  used  extensively  were  based  on  the  Fourier 
transform.  They  are  known  as  classical  or  conventional  methods  (Kay  and  Marple,1981; 
Oppenheim  and  Schafer,  1975;  Priestley,  1981)  and  can  be  classified  into  correlelogram 
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and  periodogram  methods.  The  correlelogram  method  developed  by  Blackman  and 
Tukey  involves  estimating  the  autocorrelation  function  from  the  observed  data,  window¬ 
ing  the  autocorrelation  estimate  in  an  appropriate  manner,  and  then  Fourier  transforming 
the  windowed  autocorrelation  function  to  obtain  the  estimated  power  spectrum.  In  the 
periodogram  method  the  spectral  estimate  can  be  obtained  by  taking  the  Fourier 
transform  of  the  observed  data  and  squaring  its  magnitude.  Modified  periodogram 
methods  were  also  developed  that  use  window  functions.  A  main  disadvantage  of  these 
methods  is  the  relatively  poor  resolution  when  the  data  length  is  short 

More  recendy,  model-based  methods  have  become  available  to  estimate  spectra  with 
increased  resolution.  Examples  of  model-based  methods  are  the  maximum  likelihood 
me  >od  (MLM),  autoregressive  or  AR  method,  maximum  entropy  method  (MEM),  and 
the  Pisarenko  method.  (Burg,  1975;  Cadzow  and  Ogino,1981;  Capon,  1969;  Child- 
ers,1978;  Dowla  and  Lim,1983, 1984,1985;  Dudgeon  and  Mersereau,1984;  Efron  and 
Tufts,1986;  Ekstrom,1984;  Jackson  and  Chien,1979;  Jain,1981;  Kay  and  Marple,1981; 
Kesler,1986;  Lang  and  McCleIIan,1980,1982,1983;  Lim  and  Malik,198 1,1982;  Mar- 
pie, 1987;  Marzetta,1978;  McClellan,  1982;  Nuttal,1976;  Papoulis,1981;  Parker  and  Kay- 
ran,1984;  Robinson,  1982;  Strand,  1977;  Woods,  197 6) 

Among  the  various  model-based  methods  cited,  MEM  is  particularly  interesting.  In 
1-D  the  method  is  theoretically  equivalent  to  the  AR  method  for  Gaussian  random 
processes.  In  conventional  methods  the  power  spectrum  estimate  obtained  from  a  given 
autocorrelation  sequence  assumes  no  values  of  the  sequence  outside  the  given  range; 
actually  the  values  outside  this  range  are  considered  to  be  zero.  The  effect  of  this  trunca¬ 
tion  is  overcome  by  using  a  window  function.  MEM  retains  all  the  estimated  values  of 
the  correlation  sequence  without  modification,  and  at  the  same  time  extends  the  values  of 
the  autocorrelation  function  outside  the  given  range.  The  spectral  estimate  in  this  case  is 
the  most  random  or  has  the  maximum  entropy  of  any  power  spectrum  whose  correlation 
function  matches  that  of  the  given  data.  (Burg, 1967,1968, 1975) 

In  1-D  linear  prediction  the  minimum  phase  property  of  the  prediction  error  filter 
(PEF)  insures  the  stability  of  its  inverse.  This  guarantees,  among  other  things,  that  a 
stable  AR  model  exists  for  the  data.  In  addition,  an  AR  model  of  order  N  matches  the 
N+l  given  values  of  the  correlation  function  and  extends  it  in  the  maximum  entropy 
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sense.  The  computation  of  the  prediction  error  filter  involves  solution  of  a  Toeplitz  sys¬ 
tem  of  Normal  equations.  The  equations  can  be  solved  by  the  Levinson  algorithm  which 
leads  to  a  prediction  error  variance  and  a  set  of  reflection  coefficients  to  represent  the 
filter.  In  1-D  therefore,  there  is  a  one-to-one  correspondence  between  the  given  values  of 
the  correlation  function,  the  parameters  of  the  linear  predictive  filter  that  appear  in  the 
Normal  equations,  and  the  set  of  reflection  coefficients  and  prediction  error  variance. 

In  the  2-D  case  the  linear  prediction  problem  is  more  complicated  than  in  the  1-D 
case.  Further  in  2-D  the  autocorrelation  matching  property  does  not  always  hold  and  a 
positive  definite  extension  may  not  exist  (Ekstrom  and  Woods,  1976;  Lang  and  McClel- 
lan,1980, 1982,1983;  Marzetta,1978).  In  contrast  to  the  1-D  case  the  2-D  AR  filter  is  not 
always  stable  and  the  ME  spectral  estimate  and  the  AR  spectral  estimate  are  not  neces¬ 
sarily  the  same.  Finally  there  is  no  assurance  that  a  2-D  ME  spectral  estimate  exists 
(Dickinson,  1980;  Jain, 1981;  McClellan,  1982).  Woods  (1976)  proved  that  the  maximum 
entropy  spectral  estimate  exists  and  is  unique  provided  that  the  known  autocorrelation 
coefficients  are  actually  part  of  a  valid  autocorrelation  function,  but  Dickinson  (1980) 
pointed  out  that  2-D  spectrum  estimates  based  on  a  finite  segment  of  the  autocorrelation 
function  or  on  estimated  autocorrelation  data  do  not  always  exist. 

Spectral  factorization  in  1-D  is  obtained  by  dividing  the  spectral  density  function 
(SDF)  of  the  random  process  into  two  factors.  One  factor  contains  all  its  poles  and  zeros 
inside  the  unit  circle  and  is  called  the  causal  or  minimum  phase  factor.  The  other  factor 
contains  all  its  poles  and  zeros  outside  the  unit  circle  and  is  called  the  anticausal  or  max¬ 
imum  phase  factor.  The  causal  factor  can  be  written  in  the  form  of  a  difference  equation 
such  that  the  random  process  generated  by  driving  this  difference  equation  with  white 
noise  has  the  given  SDF.  The  AR  model  is  obtained  when  the  difference  equation  is 
purely  recursive  i.e.  the  system  function  has  only  poles.  Also  the  difference  equation  can 
be  interpreted  as  a  linear  predictor,  where  a  sample  of  the  random  process  is  predicted 
from  the  linear  combination  of  previous  samples,  such  that  the  mean  square  of  a  predic¬ 
tion  error  is  minimized. 

In  2-D  the  spectral  factorization  problem  is  more  complicated.  Marzetta  (1978) 
tried  to  approach  the  spectral  factorization  problem  by  extending  the  results  of  1-D 
prediction  theory  to  2-D.  He  proved  that  the  exact  solution  can  be  obtained  by  solving  an 


infinite  set  of  linear  equations.  Using  a  finite  set  of  linear  equations  gives  an  approximate 
solution.  Ekstrom  and  Woods  (1976)  and  subsequendy  others  looked  at  the  2-D  spectral 
factorization  problem  direcdy.  Their  results  show  that  while  2-D  spectral  factorization  is 
indeed  possible,  the  resulting  white  noise-driven  filter  required  to  model  the  process  may 
have  infinite  NSHP  support  in  general. 

MLM  was  originally  developed  by  Capon  (1969)  for  seismic  array  frequency 
wavenumber  analysis.  The  MLM  spectral  estimate  may  be  derived  by  solving  a  classical 
optimal  filtering  problem.  The  filter  is  designed  to  pass  the  power  in  a  narrow  band  about 
the  signal  frequency  of  interest  and  minimize  the  power  due  to  interfering  spectral  com¬ 
ponents  such  as  noise.  From  a  computational  view  point,  MLM  is  straightforward.  It  has 
a  closed  form  solution  and  requires  only  the  inversion  of  a  non-singular  covariance 
matrix.  However  a  major  disadvantage  is  that  it  has  poorer  resolution  than  other  methods 
(such  as  AR  method  and  MEM).  Burg  (1972)  showed  that  there  exists  a  simple,  exact 
relationship  between  maximum  entropy  spectra  and  maximum  likelihood  spectra  when 
the  correlation  function  is  known  at  uniform  intervals  of  lag  for  the  1-D  case.  He  proved 
that  the  reciprocal  of  the  maximum  likelihood  spectrum  is  equal  to  the  average  of  the 
reciprocals  of  the  maximum  entropy  spectra  obtained  from  predictors  of  order  one  to  P, 
where  P  represents  the  order  of  the  MLM  filter.  Lim  and  Dowla  (1984)  showed  a  similar 
relationship  between  MLM  and  AR  spectral  estimates  for  m-D  signals  sampled  nonuni- 
formly  or  uniformly.  They  proposed  a  new  method  for  2-D  spectral  estimation  called 
improved  MLM  (IMLM)  (1985).  This  method  is  based  on  the  relationship  between  the 
MLM  and  AR  model.  The  IMLM  has  a  computational  requirement  similar  to  that  of  the 
MLM,  but  has  a  resolution  property  which  is  considerably  better  than  that  of  MLM. 
Lagunas  et  al  (1985),  and  Baggeroer  (1975)  reported  separately  on  how  to  include  gen¬ 
eral  concepts  of  the  1-D  MLM  procedure  in  a  two-channel  problem  of  cross  spectrum 
estimation.  Their  algorithms  treat  cross  spectral  estimation  separately  from  autospectral 
estimation  and  do  not  consider  the  full  multichannel  spectral  estimation  problem.  Up  to 
this  point  no  counterpart  to  IMLM  appears  to  have  been  developed  for  cross  spectral 
estimation. 


While  spectral  estimation  for  single  channel  2-D  signals  leads  to  a  number  of 
interesting  and  surprising  results,  spectral  estimation  for  multichannel  2-D  signals  leads 
to  still  more  interesting  phenomena.  The  spectrum  estimation  methods  that  will  be 
applied  to  these  signals  in  this  thesis  compute  the  entire  spectral  matrix.  That  is,  esti¬ 
mates  for  the  2-D  autospectra  and  magnitude  and  phase  of  the  cross  spectra  are  produced 
all  at  once.  As  mentioned  earlier,  the  methods  have  characteristics  of  both  1-D  mul¬ 
tichannel  methods  and  2-D  single  channel  methods.  However  properties  sometimes  exist 
that  are  found  in  neither  the  1-D  multichannel  or  the  2-D  single  channel  case.  For  exam¬ 
ple,  it  will  be  shown  that  unlike  either  of  the  other  two  cases,  the  forward  and  backward 
forms  of  the  2-D  AR  models  produce  theoretically  different  spectral  estimates. 

C.  APPLICATION  OF  SPECTRAL  ANALYSIS  MODELS  TO 

IMAGE  CODING 

Data  compression  is  the  conversion  of  a  train  of  data  into  a  train  of  low  rate  data  for 
purposes  of  storage  or  communication  over  a  digital  communication  channel.  Theory 
and  application  of  data  compression  for  images  and  speech  has  become  very  important 
due  to  increased  demand  for  availability  of  these  data  in  data  bases  and  in  integrated  net¬ 
work  environments.  Images  and  speech  are  probably  the  currently  most  important  appli¬ 
cations  of  data  compression.  (Gibson,1980;  Goodman  and  Gersho,1974;  Jain,1981;  Jay- 
ant,1974,1976,1984;  Margos.Schafer  and  Mersereau,1984;  Netravali,1980) 

Due  to  the  conversion  of  high  rate  data  to  low  rate  data  there  is  a  loss  of  fidelity  or 
an  increase  in  the  distortion.  The  aim  of  data  compression  is  to  obtain  the  best  possible 
fidelity  for  the  given  data  rate  or  to  minimize  the  rate  required  for  a  given  fidelity. 

Waveform  coding  techniques  have  become  popular  because  of  their  simplicity  and 
generally  good  performance.  Waveform  coding  can  be  classified  into  the  following 
major  categories:  pulse  code  modulation  (PCM),  predictive  coding,  transform  coding, 
interpolative,  and  extrapolative  coding.  Each  of  these  classes  can  be  further  divided  into 
fixed  and  adaptive  methods. 

In  PCM  the  signal  sampling  generally  occurs  at  the  Nyquist  rate,  and  the  sampling 
amplitude  is  represented  by  some  number  of  quantization  levels.  In  linear  predictive  or 
differential  PCM  (DPCM)  the  sample  to  be  encoded  is  predicted  from  the  encoded  values 
of  the  previously  transmitted  samples  and  only  the  prediction  error  is  quantized  for 
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transmission.  Such  an  approach  can  be  made  adaptive  either  by  changing  the  prediction 
or  quantization  or  by  not  transmitting  the  prediction  error  whenever  it  is  below  a  certain 
threshold.  In  transform  coding,  an  alternative  representation  of  the  signal  is  made  first  by 
taking  linear  combinations  of  samples  (called  coefficients)  in  a  block  of  data  and  then 
quantizing  the  coefficients.  Transform  coders  can  be  made  adaptive  by  changing  the  type 
of  transformation  of  the  coefficients.  Interpolative  and  extrapolative  coders  attempt  to 
send  certain  samples  to  the  receiver  and  either  interpolate  or  extrapolate  all  the  rest. 
Adaptation  can  be  incorporated  by  changing  the  criterion  for  selection  of  the  samples  to 
be  sent  and/or  the  procedure  for  interpolating  or  extrapolating  the  remaining  samples. 
Predictive  and  transform  coding  can  also  be  combined  resulting  in  hybrid  coding.  All  of 
these  techniques  share  a  fundamental  property:  the  actual  quantization  or  coding  is  done 
on  scalars.  (Jayant,1974, 1976,1984;  Zetterberg,  Ericsson  and  Brusewitz,1982) 

Recently  vector  quantization  (VQ)  has  evolved  as  a  new  and  powerful  speech  and 
image  coding  technique.  The  data  to  be  encoded  is  divided  into  small  blocks  and  then 
encoded  block  by  block.  The  idea  is  to  identify  a  list  of  reconstruction  levels  of  possible 
blocks  of  data  which  represent  the  encoded  information.  We  refer  to  the  reconstruction 
levels  in  VQ  as  the  codebook.  The  algorithm  most  often  used  for  generating  this  code¬ 
book  is  named  as  the  Linde-Buzo-Gray  (LBG)  algorithm  and  seeks  to  develop  centers  or 
clusters  of  the  data  in  a  multidimensional  metric  space  which  can  serve  as  reconstruction 
levels.  (Cuperman  and  Gersho,1985;  Goldberg,  Boucher  and  Shilien,1986;  Gray  and 
Linde,1982;  Gray,1984;  Hang  and  Woods, 1985;  Linde,  Buzo  and  Gray, 1980;  Makhoul, 
Roucos  and  Gish,  1985) 

In  our  work  we  concentrate  on  linear  predictive  coding  for  color  images.  The  coder 
is  based  on  the  multichannel  linear  predictive  models  developed  in  the  thesis  and  the 
error  residual  is  coded  at  two  or  three  levels  for  each  channel.  Methods  involving 
specific  linear  predictive  models  for  each  image  to  be  transmitted  are  compared  to 
methods  involving  use  of  an  "average"  (non-optimal)  set  of  coefficients  for  the  prediction 
error  filter. 


D.  AN  OUTLINE  OF  THE  DISSERTATION 

The  purpose  of  this  dissertation  is  to  introduce  and  analyze  some  new  algorithms  for 
multichannel  2-D  linear  prediction  and  power  spectrum  estimation  (the  linear  predictive 
models  used  in  the  spectrum  estimation  are  also  applied  to  the  coding  of  color  images). 
The  thesis  is  organized  as  follows: 

In  Chapter  II  we  briefly  review  some  of  the  existing  results  on  single  channel  2-D 
spectral  estimation.  The  2-D  AR  model  and  its  specific  differences  from  1-D  models  are 
described.  Two  specific  2-D  AR  models  are  considered  here  namely  the  nonsymmetric 
half  plane  (NSHP)  model  and  quadrant  plane  (QP)  model.  Spectral  estimation  experi¬ 
mental  results  are  compared  with  results  of  the  Fourier  transform  techniques.  We  also 
review  in  this  chapter  the  lattice  structure  and  a  proposed  2-D  Burg  algorithm. 

In  addition,  some  original  results  are  presented  in  Chapter  II  that  are  later  extended 
to  the  2-D  multichannel  case.  We  develop  a  method  of  parameter  estimation  that  allows 
the  model  parameters  to  be  estimated  directly  from  the  data  without  prior  estimation  of  a 
correlation  function.  This  method  is  based  on  the  close  relation  between  the  single  chan¬ 
nel  2-D  linear  prediction  problem  and  the  multichannel  1-D  linear  prediction  problem. 
We  refer  to  the  method  as  a  direct  method.  Secondly  we  propose  a  quantitative  measure 
for  comparing  the  similarity  of  two  single  channel  2-D  spectral  estimates  based  on  the 
statistical  divergence  and  use  it  to  compare  the  spectral  estimates  generated  by  various 
2-D  models. 

Chapter  HI  deals  with  the  representation  and  statistical  characterization  of  mul¬ 
tichannel  2-D  signals.  We  discuss  multichannel  2-D  linear  prediction  theory  using  both 
NSHP  and  QP  models.  The  multichannel  2-D  AR  model  and  specific  differences  from 
single  channel  models  are  described.  The  concept  of  forward  and  backward  forms  of 
linear  prediction  and  their  related  AR  models  are  introduced.  It  is  shown  that  unlike  the 
single  channel  2-D  case,  the  forward  and  backward  models  are  not  identical.  We  also 
define  the  four  possible  QP  models  and  discuss  their  differences.  Relations  between  2-D 
multichannel  problems  and  higher  order  1-D  multichannel  problems  are  then  discussed. 
These  results  are  used  to  formulate  a  direct  method  for  estimation  of  the  2-D  multichan¬ 
nel  model  parameters  (i.e.  without  explicit  prior  estimation  of  the  correlation  function). 


The  divergence  measure  used  for  comparing  spectral  estimates  of  single  channel  2-D  ran¬ 
dom  processes  in  Chapter  II  is  then  developed  for  the  multichannel  case. 

Chapter  IV  is  concerned  with  multichannel  2-D  spectral  estimation  based  on  AR 
models.  Here  the  problem  is  to  obtain  accurate  spectral  estimates  given  a  finite  number 
of  data  samples.  We  begin  by  summarizing  the  spectral  estimation  models,  and  show 
that  since  the  NSHP  forward  and  backward  models  have  distinctly  different  model 
parameters,  in  general  this  leads  to  different  spectral  estimates.  Similarly  it  is  shown  that 
in  general  none  of  the  four  possible  multichannel  2-D  QP  models  give  the  same  estimate 
for  the  spectral  matrix.  This  suggests  a  generalization  of  the  procedure  used  by  Jackson 
and  Chien  for  combining  multiple  quadrant  based  spectral  estimates  into  a  single  com¬ 
bined  spectral  estimate.  We  call  this  procedure  the  combined  quadrant  (CQ) 
method,  and  it  can  be  used  effectively  with  the  direct  method  of  parameter  estima¬ 
tion  developed  in  Chapter  III. 

A  comprehensive  set  of  experimental  studies  for  estimating  the  spectral  matrix  for 
2-channel  2-D  random  processes  is  presented.  The  estimate  includes  the  2-D  autospec¬ 
trum  for  each  channel  and  magnitude  and  phase  of  the  cross  spectra.  The  estimates  com¬ 
pared  correspond  to  NSHP,  CQ,  and  direct  CQ  models. 

In  Chapter  V  we  review  the  2-D  single  channel  Maximum  Likelihood  Method 
(MLM)  of  spectral  estimation  and  describe  extensions  of  the  MLM  method  to  cross  spec¬ 
trum  estimation.  We  then  develop  the  MLM  method  specifically  for  the  2-D  multichan¬ 
nel  case,  and  compare  our  results  to  the  earlier  results  where  the  components  of  the  spec¬ 
tral  matrix  are  computed  individually.  We  discuss  the  differences  in  the  methods  and 
show  experimentally  that  our  algorithm  gives  results  very  close  to  those  for  the  single 
channel  MLM  algorithm  and  the  extended  cross  spectrum  MLM  estimation  technique. 

Since  MLM  estimates  generally  have  significantly  poorer  resolution  than  those  based 
on  AR  models,  we  consider  the  so-called  Improved  MLM  (IMLM)  of  Lim  and  Dowla 
and  develop  it  for  the  multichannel  case.  This  method  retains  the  computational  simpli¬ 
city  of  the  MLM,  but  gives  better  resolution.  We  carry  out  an  experiment  to  compare  the 
amplitude  and  phase  of  cross  spectra  of  AR,  MLM  and  IMLM  techniques.  Finally,  in 
this  chapter  we  measure  and  compare  the  resolution  properties  of  the  estimate  experi¬ 
mentally  as  a  function  of  signal-to-noise  ratio  and  as  a  function  of  the  model  order. 


Chapter  VI  applies  the  2-D  spectral  analysis  models  developed  earlier  to  the  problem 
of  image  coding.  We  are  concerned  here  with  predictive  coding  i.e.  linear  prediction 
followed  by  quantization  of  the  prediction  error.  We  begin  by  summarizing  the  two 
types  of  quantization  namely  scalar  quantization  and  vector  quantization. 

In  our  experimental  procedure  in  this  chapter  we  compare  two  methods  initially,  in 
the  first,  the  whole  frame  of  the  image  is  divided  into  subframes;  the  predictor 
coefficients  are  then  computed  separately  for  each  subframe.  In  the  second,  the  predictor 
coefficients  are  obtained  for  the  whole  frame  of  the  image.  Both  of  these  methods  have 
the  disadvantage  that  the  linear  prediction  coefficients  matrices  must  be  computed  in  real 
time,  and  transmitted  to  the  receiver  as  side  information.  This  greatly  increases  complex¬ 
ity  of  the  coding  system.  As  an  alternative  we  consider  using  a  fixed  set  of  prediction 
matrices,  i.e.  one  that  does  not  depend  on  the  specific  image  being  coded.  In  this  way 
both  receiver  and  transmitter  have  the  linear  prediction  matrices  and  no  side  information 
has  to  be  transmitted.  Such  prediction  matrices  can  be  generated  by  various  averaging 
methods  discussed  in  the  chapter.  We  compare  the  results  of  this  coding  to  that  resulting 
from  the  previous  two  methods. 

The  last  chapter,  VII,  concludes  the  thesis.  It  briefly  summarizes  our  most  important 
results  and  cites  areas  for  further  research. 


IL  SINGLE  CHANNEL  2-D  SPECTRAL  ESTIMATION 


A.  INTRODUCTION 

Two-dimensional  (2-D)  digital  signal  processing  is  concerned  with  the  processing  of 
discrete  signals  which  can  be  represented  on  2-D  lattice.  For  instance,  a  digital  image  is 
a  2-D  array  of  real  numbers  representing  intensity  as  a  function  of  spatial  position.  Geo¬ 
logical  data  and  other  sampled  waveforms  received  from  an  array  of  sensors  can  also  be 
considered  as  2-D  or  higher  dimensional  discrete  signals.  Since  2-D  signals  can  be 
deterministic  or  stochastic  the  mathematical  characterization  of  2-D  signals  can  be  deter¬ 
ministic  or  statistical.  In  the  deterministic  representation,  each  value  of  the  2-D  sequence 
is  uniquely  determined  by  a  mathematical  expression.  In  statistical  representation  a  sig¬ 
nal  is  specified  by  the  average  properties  such  as  the  mean,  correlation,  and  power  spec¬ 
trum.  Many  2-D  signals  can  be  considered  as  a  sample  function  of  a  2-D  discrete,  wide 
sense  stationary  (or  homogeneous)  random  field,  which  is  described  in  terms  of  its  mean 
and  covariance  function. 

An  important  topic  in  stationary  2-D  random  process  analysis  is  power  spectrum 
estimation.  The  problem  of  spectral  estimation  is  considerably  more  complex  in  the  2-D 
case  than  in  the  1-D  case  (Cadzow  and  Ogino,1981;  Dudgeon,1984;  Ekstrom,1984; 
Ekstrom  and  Woods,1976;  Jain.1977,1978,1981;  Lang  and  McClellan,  1980, 1982, 1983; 
Lim  and  Malik,1981;  Marzetta,1978, 1979,1980;  McClellan,  1982;  Nikias  and 
Raghuveer,1983;  Woods,  1976).  The  spectral  estimation  problem  may  be  stated  briefly  in 
the  following  way.  Given  samples  of  a  stationary  and  homogeneous  random  field 
x(nx,ri2),  specified  by  its  second  order  statistics,  estimate  its  power  spectrum. 

There  are  many  different  techniques  that  can  be  used  to  estimate  the  2-D  power 
spectrum  (McClellan,  1982).  Conventional  methods  based  on  the  discrete  Fourier 
transform  (DFT)  generalize  from  1-D  time  series  analysis  in  a  straightforward  manner 
when  the  sampling  is  uniform.  However  the  more  recent  high  resolution  model-based 
methods  such  as  MEM  (Lang  and  McClellan,  1982;  Lim  and  Malik,  198 1,1 982;  Roucos 
and  Childers,1979,1980;  Wemecke  and  D’Addario,1977),  and  AR-modeling  (Dudgeon 


and  Mersereau,1984;  Ekstrom,1984;  Jackson  and  Chien,1979;  Lacoume,1983;  Sharma 
and  Chellappa,1986),  take  on  a  different  and  more  complicated  form. 

In  this  chapter  we  briefly  review  some  of  the  existing  results  on  single  channel  2-D 
spectral  estimation.  The  2-D  AR  model  and  its  specific  differences  from  1  -D  models  are 
described.  Two  specific  2-D  AR  models  are  considered  here  namely  the  nonsymmetric 
half  plane  (NSHP)  model  and  quadrant  plane  (QP)  model.  Some  AR  model-based  spec¬ 
tral  estimation  experimental  results  are  given  and  compared  with  results  of  classical 
Fourier  transform  techniques.  We  also  review  in  this  chapter  the  lattice  structure  for  the 
2-D  AR  model  developed  by  Marzetta  and  a  proposed  2-D  Burg  algorithm. 

Some  original  results  are  also  presented  in  this  chapter  that  are  later  extended  to  the 
2-D  multichannel  case.  We  develop  a  method  of  parameter  estimation  that  allows  the 
model  parameters  to  be  estimated  directly  from  the  data  without  prior  estimation  of  a 
correlation  function.  This  method  is  based  on  the  close  relation  between  the  single  chan¬ 
nel  2-D  linear  prediction  problem  and  the  multichannel  1-D  linear  prediction  problem. 
We  refer  to  the  method  as  a  direct  method.  In  addition  we  propose  a  quantitative  meas¬ 
ure  for  comparing  the  similarity  of  two  single  channel  2-D  spectral  estimates  based  on 
the  divergence  and  use  it  to  compare  the  spectral  estimates  generated  by  various  2-D 
models. 

B.  REVIEW  OF  CLASSICAL  METHODS 

Historically,  methods  based  on  the  Fourier  transform  have  been  widely  used  for 
spectral  estimation  (Jenkins  and  Watts,  1968;  Oppenheim  and  Schafer,1975;  Priest¬ 
ley,  1981;  Robinson,  1982).  The  advent  of  the  Fast  Fourier  Transform  (FFT)  algorithm 
helped  to  make  these  techniques  very  attractive.  Although  many  variations  exist,  there 
are  fundamentally  only  two  distinct  estimators  of  the  Fourier  type.  One  is  the  correlelo- 
gram  method  which  is  based  on  the  fact  that  the  power  spectrum  and  the  autocorrelation 
function  form  a  Fourier  transform  pair.  The  other  is  the  periodogram  estimate  and  its 
various  smoothed  versions.  The  use  of  these  estimators  in  the  multidimensional  case  is 
virtually  identical  to  the  use  in  the  1-D  case. 


1.  Correlelogram  Method  of  Spectral  Estimation 


We  define  the  power  spectrum  of  a  random  process  xin^n^  as  the  Fourier 
transform  of  the  autocorrelation  function  Rx  (ky,k£ 


Sx(co1,co2)=  £  £  Rx(kM.e  >a*2  (2.1) 

where: 

Rx  (*  iJc2)  =  E^x(nhn^x  ( n  y-k \,n2-k^  (2.2) 

and  E  [  .  ]  denotes  the  expectation  over  the  ensemble  of  the  random  process.  Hence, 

given  a  finite  number  of  data  samples  on  a  rectangular  lattice 


x(nl,n2),  the  estimate  for  the  autocorrelation  can  be  made 


only  on  a  finite  range  -yt  (k^k^,  -(P  lyP  2)<,(k  ^k^iP  lyP  ,  and  the  power  spectrum 
estimate  can  be  formed  as  : 

Pi  Pi  .  .  .  . 

S{ co1,co2)=  X  X  /?(*!,* 2)  e~7a>1V;C0^  (2.3) 

ky=-P  1  ki=-P2 

Since  with  most  methods  for  estimating  the  correlation  function  fewer  data  samples  enter 
into  estimation  of  covariances  at  larger  lags,  the  reliability  of  the  estimates  decreases 
with  increasing  lag.  This  necessitates  application  of  a  window  to  the  estimated  correla¬ 
tions  to  reduce  the  contribution  of  correlations  at  larger  lags  to  the  power  spectral  estima¬ 
tion.  A  typical  windowed  correlelogram  estimate  is  of  the  form 


Pi  Pi  ,  .  .  .  . 

S(C01,to2)=  X  X  C  (kltk2)  R  (k  i,k2)  (2.4) 

ki=~P  1  ki=-P  2 

where  C  ( k  i,k2)  is  the  window  function.  This  multiplication  of  the  estimated  autocorrela¬ 
tion  sequence  by  a  window  function  is  equal  to  the  convolution  of  the  true  power  spec¬ 
trum  with  the  transform  of  the  window  function.  In  the  1-D  case,  the  triangular  window 
or  other  windows  such  as,  the  raised  cosine  may  be  used.  Harris  (1978)  provides  a 


summary  of  various  windows  applied  for  spectral  analysis  computation.  In  the  2-D  case, 
windows  are  most  often  generated  either  by  taking  the  outer  product  of  two  1-D  windows 
to  generate  a  2-D  window  with  rectangular  region  of  support  or  by  sampling  a  circularly 
rotated  1-D  window  function  to  generate  a  2-D  window  with  circular  region  of  support 
(Dudgeon  and  Mersereau,1984). 

2.  Power  Spectrum  Estimation  Using  Periodoeram  Methods 


As  mentioned  earlier,  several  different  estimates  of  the  2-D  autocorrelation  are 
possible.  If  the  estimated  autocorrelation  function  R(klJC'2)  of  the  random  process 
x(nx,n2)  given  on  a  finite  range  0£n  X<N l ;  0<n2<N2  is  taken  as 


R(kx,k  2)  = 


Ni-\kl\-l  #2-1*21-1 


*1*2  ni=0  A 


X  X(nl,n^x(n1-kl,n2-k2) 


\kx  \<Nl-l ;  I >k2 1 <ZV 2— 1 


then  the  estimated  power  spectrum  of  the  random  process  can  be  given  by  the  Fourier 
transform  of  the  estimated  autocorrelation  function 


tfi-i  JVr-i  .  .  .  . 

S(  co1,co2)=  £  2  *(*  1,*2).<r'<a,V'<“ifc2  (2.6) 
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Since  the  Fourier  transform  of  the  finite  data  segment  x(nx,n 2),  (0</i  X<N  x ;  0 <n  2<N 2)  is 


„  N t-l  N 2-I 

X{eJ«» i.«>2))=  £  ^X(ni,n2 


it  is  easy  to  show  that 


«i=0  n2=0 


^1^2  n1=o  «2=o 


=  — IX(e>(<0,,<0l))  I2 
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This  is  called  the  periodogram.  Thus  in  the  periodogram  method  the  estimated  power 
spectrum  S  (oij.o^  can  be  obtained  by  taking  the  2-D  Fourier  transform  of  the  given  data 
set  x(n  !,n2)  and  squaring  its  magnitude.  We  can  generalize  the  periodogram  estimate  by 
slightly  to  form  the  modified  periodogram  S  (cd^cd^  by  multiplying  the  data  by  a  2-D 
window  function  C'(n ltn 2)- 


5(0)!  ,©2)  = 


Ni-INj-l 

!  j  j  C/(nl,n2)x(nl,n2)e 


*1*2  nT=On% 


The  window  C  appearing  in  Eqn.  (2.4)  is  then  the  2-D  convolution  of  the  window  C' 
with  itself. 

The  periodogram  is  known  to  be  a  poor  estimate  of  the  power  spectrum  of  the 
process.  Bartlett  suggested  a  modification  of  the  periodogram  to  obtain  a  statistically 
more  reliable  spectral  estimate.  In  this  case  the  signal  is  divided  into  L-blocks,  and  the 
periodogram  of  each  block  is  computed.  The  individual  periodograms  are  then  averaged 
to  obtain  the  spectral  estimate. 


1  L"1 

5(G)!,0)2)  =  —  2)  Si  ((!>!, CD2) 
L  1=0 


(2.10) 


Where  5/( co^co^  represents  the  periodogram  power  spectral  estimate  of  the  block  /.  In 
this  case  the  variance  is  reduced,  but  at  the  expense  of  a  reduction  in  the  frequency  reso¬ 
lution.  Welch  (1967)  proposed  dividing  the  data  record  into  either  overlapping  or  nono¬ 
verlapping  segments.  Each  data  segment  is  windowed  before  computing  its  periodo¬ 
gram.  The  resulting  periodograms  are  then  averaged  as  above  (see  Eqn.  (2.10)).  The 
resulting  estimate  is  asymptotically  unbiased  and  has  lower  variance  than  the  Bartlett 
method.  (Oppenheim  and  Schafer,  1975;  Priestley,1981) 

The  disadvantage  of  both  Fourier  methods,  is  the  resolution  limit.  Longer  data 
sets  provide  better  resolution  but  result  in  increased  variance.  For  this  reason  the  Fourier 
methods  have  met  with  limited  success  and  much  recent  research  has  been  directed  at 
deriving  high  resolution  estimators.  The  remainder  of  this  chapter  will  be  devoted  to 
some  of  these  other  techniques. 
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C.  AUTOREGRESSIVE  (AR)  SPECTRAL  ESTIMATION 

Power  spectral  estimation  methods  based  on  the  Fourier  transform  are  known  also  as 
"conventional"  or  classical  methods.  The  computational  efficiency  of  FFT  algorithms 
makes  these  methods  simple  and  easy  to  implement.  On  the  other  hand  the  low  resolu¬ 
tion  afforded  by  these  methods  is  due  in  part  to  the  fact  that,  the  data  is  given  on  a  finite 
range  and  is  assumed  to  be  zero  outside  this  range  Modem  spectrum  estimation  tech¬ 
niques  give  higher  resolution  than  the  conventional  methods.  These  methods  have  also 
been  called  data  adaptive  methods  or  high  resolution  spectral  estimation  techniques.  The 
various  modem  spectral  estimation  techniques  that  exist  in  the  literature  may  be  based  on 
different  principles,  but  all  strive  towards  a  common  goal:  to  improve  the  resolution  of 
the  power  spectral  estimate  beyond  the  Fourier  limit  by  explicitly  or  implicitly  providing 
an  extrapolation  of  the  data  outside  it’s  known  extent.  Since  most  of  the  modem  spectral 
estimation  techniques  are  model-based,  they  are  usually  parameteric  in  nature. 

Since  the  theory  for  1-D  parametric  spectral  estimation  techniques  is  well  under¬ 
stood,  a  natural  intermediate  step  in  the  development  of  2-D  spectral  estimators  was  the 
extension  of  1-D  spectral  estimators  to  the  2-D  problem.  A  method  of  spectral  estima¬ 
tion  which  has  attracted  much  attention  in  1-D  signal  analysis  is  the  technique  of  Max¬ 
imum  Entropy  (ME)  proposed  by  Burg  (1967,1968,1975).  This  method  was  shown  to  be 
equivalent  to  AR  modeling  (Childers,  1978;  Nuttall,1976)  in  1-D.  The  success  of  the  ME 
method  in  1-D  has  led  researchers  to  explore  this  problem  in  2-D.  Unfortunately,  the 
simplicity  and  elegance  of  this  method  is  lost  in  two  and  higher  dimensions. 

The  ME  spectrum  parameters  for  the  2-D  case  are  in  general  different  from  the  those 
for  spectral  estimation  by  the  AR  technique.  Maximum  entropy  requires  the  solution  of  a 
nonlinear  optimization  problem  (McClellan,  1982),  and  there  is  no  known  closed  form 
solution.  The  method  requires  one  to  maximize  the  entropy  subject  to  a  correlation 
matching  constraint  Malik  and  Lim  (1981)  suggested  an  iterative  algorithm  for  solving 
the  2-D  ME  problem. 

A  more  fundamental  problem  for  the  ME  method  is  that  the  ME  spectral  estimate 
may  not  exist.  Woods  (1972,1976)  has  shown  the  existence  and  uniqueness  of  a  2-D 
discrete  Markov  random  field  which  agrees  with  given  correlation  values  on  an  array  of 
nearest  neighbors.  The  resulting  power  spectrum  estimate  is  the  2-D  ME  spectrum.  He 


suggested  an  iterative  algorithm  to  evaluate  the  Markov  spectral  estimation  for  a  regu¬ 
larly  spaced  array.  However  Dickinson  (1980)  pointed  out  that  the  2-D  Markov  spec¬ 
trum  estimates  based  on  estimated  autocorrelation  data  may  not  always  exist.  Thus 
Woods  results  apply  only  when  existence  of  the  estimate  is  known. 

A  power  spectrum  estimate  of  a  2-D  random  process  can  also  be  developed  using  an 
AR  model.  In  this  case  the  random  process  x(nj,n2)  is  represented  by  an  AR  model 
defined  over  some  region  a. 

*(*1*2)=“  X  (#t  i~i  1  >n  2~i  2)  +  *'(f*1>'i2) 

(«ltij)€a  (2.11) 

(•l.ijMO.O't 

Where  an  are  the  filter  coefficients  and  w  («!,/:  2)  is  2-D  white  noise.  Although  the 
parameters  of  this  model  are  most  often  generated  by  considering  the  2-D  linear  predic¬ 
tion  problem  with  region  of  support  a  and  formulating  and  solving  a  set  of  Normal  equa¬ 
tions,  this  method  is  strictly  valid  only  when  an  AR  representation  of  the  2-D  random 
process  is  known  to  exist.  That  is,  the  linear  prediction  model  is  the  same  as  the  AR 
model  only  when  the  process  is  truly  described  by  a  model  with  the  postulated  form  of 
support.  In  this  case  the  error  produced  by  the  linear  prediction  is  white  and  the  original 
random  process  can  be  generated  by  inverse  filtering. 

The  linear  prediction  model  is  sometimes  refered  to  as  the  minimum  mean-square 
error  model  (Jain,1981).  It  differs  in  general  from  the  AR  or  white  noise  driven  model  in 
that  the  error  residuals  are  not  guaranteed  to  be  white. 

In  practice,  when  a  NSHP  model  of  sufficiently  large  size  is  used,  the  error  residuals 
are  found  to  be  nearly  uncorrelated  so  that  the  distinction  between  the  two  types  of 
models  is  not  so  important.  In  general,  this  will  not  be  true  for  models  with  quadrant 
support  and  so  single  quadrant  AR  models  (as  will  be  seen)  tend  to  produce  poor  spectral 
estimates.  Nevertheless,  combinations  of  the  quadrant-based  models,  while  lacking  a 
theoretical  justification,  tend  to  produce  reasonably  well  behaved  spectral  estimates. 

In  the  NSHP  model  the  linear  prediction  model  and  the  AR  model  become  identical 
when  we  have  an  infinite  region  of  support  (Ekstrom  and  Woods,1976;  Jain, 1981;  Mar- 
zetta,  1978, 1979, 1980).  In  this  case  the  spectrum  of  the  process  is  given  by 


where 
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(2.13) 


and  a£2  is  the  prediction  error  variance.  Since  infinite  NSHP  support  can  not  be  used  in 
practical  applications  formula  (2.12)  is  used  with  a  finite  region  a.  Experimental  results 
show  that  NSHP  support  of  modest  size  is  adequate  to  obtain  a  good  spectral  estimate. 

Although  models  with  support  in  only  a  single  quadrant  are  seldom  adequate  to 
represent  a  general  2-D  random  process,  it  has  been  mentioned  that  quadrant  support  is 
useful  for  spectrum  estimation  if  two  or  more  quadrant  models  are  combined.  The  two 
types  of  AR  models,  QP  and  NSHP,  are  discussed  in  more  detail  below. 

uadrant  Plane  (OP)  Model 


Generally  the  region  a  of  an  AR  model  can  take  many  different  forms.  Let  a  be 
the  rectangular  region  shown  in  Fig.  2.1.  In  this  case  the  region  a  consists  of  PxxP2 
points  and  L  x  and  L2  are  chosen  to  be  within  the  range  -P  X<LX<D,  ~P  2<L  2-0- 

First  quadrant  support  can  be  considered  as  a  special  case  of  rectangular  support 
where  L  1=L2=0.  In  this  case  the  region  of  support  a  for  the  first  quadrant  plane  will  be 

^(VA-l  x  (XPj-l]  •  In  this  thesis  we  define  an  N,h  order  first  quadrant  plane  model  to 

be  one  with  P  x-\=P2-l=N.  When  P  t  and  P2  are  not  equal  we  can  refer  to  the  order  as 
{Px-\J>2-\). 

Now  letx(n!,n2)  represent  a  zero-mean  stationary  2-D  random  signal.  Then  the 
estimated  value  x(nx,n2 )  using  first  quadrant  linear  prediction  model  can  be  written  in 
the  following  form  : 


Pi-iPj-i 

i,/t2)=-Z  i-i\,n2-ii)  ,  (ix,i £*(0,0) 

« j=0  j'i=0 

If  we  define  the  matri;.  2-D  correlation  function  as 


(2.14) 
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and  where 


ao,o  -  1 


G\  =  E  (x(nl,n 2)~x  (nl,n2))2 


(2. 1 8d) 
(2. 1 8e) 


Note  that  a2  is  the  prediction  error  variance.  The  spectral  estimate  yielded  by  the  first 
QP  model  is  given  by: 


S1(o)1,co2)  = 


l//1(co1,co2)  I 


(2.19) 


where 


/Vi  /Vi 

H\((ol,a>2)=  X  X  .  oM=l 

/l  1=0  «2=0 


(2.20) 


Naturally,  it  is  possible  to  develop  a  model  assuming  other  regions  of  support  for  the 
filter  coefficients.  If  we  set  up  the  problem  assuming  that  the  second  quadrant  is  our 
region  of  support,  we  will  get  a  different  answer.  Specifically,  in  terms  of  Fig.  2.1,  let 
Ll=~Pl  and  L2-  0.  Then  the  second  quadrant  prediction  equation  can  be  written  as 


/Vi  /Vi 

x(n1,n2)  =  -£  L  bjj^(nl+j1,n2-j2)  ,  0'j ,j2)^(0,0) 
j  i=o  j  2=0 


(2.21) 


and  the  Normal  equations  take  the  form  : 


R  .  B  =  S, 


(2.22) 


where 
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and  where  a2  is  the  prediction  error  variance  of  the  second  quadrant  model 
The  resulting  spectral  estimate  of  the  second  quadrant  is 


52(co1,o>2) 


°2 


l//2(0)1,CCh)l2 


where 


>  ^00-l 


(2.25) 
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"2(® i.«2)=Z  E  bninieJ(Shn'e~J<°>ni 

ni=0  «2=0 

and  is  not  equal  to  the  spectral  estimate  of  the  first  quadrant  S^co^o^).  Jackson  and 
Chien  (1979)  observed  that,  in  typical  cases  of  estimating  sinusoids  in  noise  the  spectral 
estimates  of  each  quadrant  were  skewed.  This  is  not  surprising  since  the  prediction 
geometry  of  quadrant  plane  causal  models  is  too  restrictive.  To  remove  the  skewness, 
they  suggested  designing  two  quadrant  plane  models  with  support  on  different  quadrants 
and  combining  them.  Their  combined  estimate  can  be  written  as  : 


S  !(<£>! ,£02)  S2(C01,C02) 

and  tends  to  produce  a  more  symmetric  spectral  estimate.  The  specific  form  (2.26)  is 
important,  since  Jackson  and  Chien  showed  that  other  symmetric  combinations  of  S  i  and 
S2  did  not  have  desirable  properties.  We  will  call  this  mode]  a  combined  quadrant  (CQ) 
model. 

2.  Nonsymmetric  Half  Plane  (NSHP)  Model 


The  NSHP  model  is  a  very  important  form,  since  it  is  always  possible  to  factor 
an  arbitrary  2-D  spectral  density  into  factors  with  infinite  extent  NSHP  support.  While  it 
is  not  practical  in  spectral  modeling  to  use  very  large  support  regions,  NSHP  support  of 
modest  size  has  been  found  to  give  reasonable  results.  The  infinite  region  of  support  a 
for  the  general  NSHP  linear  prediction  model  (also  called  a  causal  model)  is  shown  in 
Fig.  2.2a. 

We  define  an  N‘h  order  NSHP  model  to  be  one  with  N  points  above,  below,  and 
to  the  right  of  the  reffence  point  (i,j)  =  (0,0).  That  is,  P  [-1  =P2-1  =N  as  shown  in  Fig¬ 
ure  2.2b.  When  Px  *P2  we  may  also  refer  to  the  order  of  the  filter  as  (Pj-l/^-l) 
order. 

Now  we  consider  the  problem  of  finding  a  NSHP  linear  prediction  model  of 
finite  order.  Let  .r  (nl,n2)  represent  a  stationary  zero-mean  random  process.  The  linear 
prediction  equation  takes  the  form  : 


(b)  Region  of  finite  support 


2.2  NSHP  region  of  support 


’R(kJ> 2-1)  R(kf2-2)  .  .  .  R(k,-P2+ 1) 
R  (kj3  2)  R(k, 0)  .  .  .  /?(£,-/>  2+2) 

R,(/k)  =  R/(-*)  = 

/?  (^^P2-l)  R(kZP2-2)  .  .  .  R  (k  ,0) 


■tf(0,0)  R( 0,-1)  .  .  .  R(0-P2+iy 
R( 0,1)  fl(0,0)  .  .  .  /?(()-/>  2+2) 

R,(0)  = 

/?  (0^2-1)  R  (0^2-2)  •  •  •  *(0,0) 


and  where 


Note  that  R  (kiJc^)  is  the  2-D  autocorrelation  function  for  the  data  defined  by  Eqn.  (2.15) 
and  a£2  is  the  prediction  error  variance.  The  spectral  estimate  resulting  from  the  NSHP 
model  is  given  by: 


where 


Siz^z  2) 


l//(zi,Z2)l2 


(2.31) 


//(z1^2)  =  1+2  a0i2-z2il+  Z  2  z2i2  (2.32) 

»2=1  <i=l  *2=-(^r-l) 

It  is  of  interest  to  study  the  behaviour  of  the  AR  spectral  estimation  algorithms, 
and  compare  the  results  with  those  of  Fourier  transform  techniques.  A  sinusoidal  signal 
plus  white  noise  w(nvn^)  has  been  investigated  as  a  test  example.  Assume  our  process 
is : 


L 

x(n1,/i2)=  2cos(niC0/1+n2C0;2)  +  *'('ii,'i2)  (2.33) 

i=l 

where  w(nl,n2),  is  a  zero  mean  independent  white  noise  signal,  and  L  represents  the 
number  of  sinusoids.  Estimates  were  computed  for  a  dataset  size  of  16x16  with  L=2, 

oin=oji2=-^-,  and  co21=tt)22:='T'  Fi§ure  2.3  shows  the  NSHP  and  CQ  spectral  estimates. 

The  location  of  the  peaks  are  in  the  correct  place  and  there  is  good  resolution.  Fig.  2.4 
shows  the  result  of  power  spectral  estimation  on  the  same  data  applying  both  correlelo- 
gram  and  periodogram  techniques.  Comparison  of  Figs.  2.3  and  2.4,  shows  that  the 
NSHP  and  CQ  models  have  significantly  higher  resolution  than  the  Fourier  transform 
techniques. 


D.  LATTICE  STRUCTURE 

The  key  step  in  parametric  spectral  estimation  is  the  computation  of  the  parameters 
of  the  model.  Least  square  estimation  of  model  parameters  (in  the  AR  case)  is  one 
approach  and  requires  the  solution  of  a  set  of  linear  equations  involving  the  sample 
covariance  matrix.  Another  approach  that  has  many  advantages  in  1-D  is  to  represent  the 
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(a)  NSHP  (b)  QP 
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model  in  terms  of  a  finite  number  of  reflection  coefficients  which  are  chosen  to  obtain  a 
good  fit  between  the  whitening  filter  model  and  the  data.  The  realization  of  the  filter  is 
then  in  the  form  of  a  lattice  structure. 

In  2-D  there  has  been  no  direct  counterpart  to  the  lattice  structure  of  the  prediction 
filter  that  exists  in  1-D.  Marzetta  (1978),  however  developed  a  set  of  2-D  reflection 
coefficients.  In  his  approach  the  available  data  is  used  to  estimate  the  2-D  autocorrela¬ 
tion  function  to  a  finite  lag,  and  then  the  estimated  autocorrelation  function  is  used 
to  compute  the  reflection  coefficients  which  determine  the  whitening  filter  H(zl,z2). 
This  is  similar  to  what  is  done  in  1-D. 

Given  a  lattice  structure  in  2-D,  another  approach  for  choosing  the  reflection 
coefficients  is  analogous  to  the  Burg  algorithm.  Estimates  of  the  filter  parameters  as 
obtained  directly  from  the  data,  and  the  lattice  structure  can  be  formed  from  the  forward 
and  backward  predictions.  Fig.  2.5  depicts  L  lattice  sections  connected  in  cascade.  The 
input  to  the  system  is  x(n  x,n2)  and  the  two  outputs  equal  forward  and  backward  predic¬ 
tion  errors  at  step  L.  The  initial  values  of  the  forward  and  backward  prediction  errors  of 
model  order  (0,0)  are  given  by: 


x{nx,n2)  =  e  (0'0)(n  x,n2)  =  e  *m(n  j  ,n  2) 


(2.34) 


Parker  and  Kayran  (1984)  developed  a  different  kind  of  lattice  representation  in  2-D. 
Their  initial  development  was  done  for  quadrant-based  models  but  since  their  representa¬ 
tion  is  neither  a  true  whitening  filter  nor  an  optimal  least  squares  prediction  filter,  we  will 
not  put  .ue  it  further  here.  Marzetta’s  method  is  outlined  briefly  below. 

1.  2-D  Levinson  Aleorithm  (Marzetta 


Marzetta  proposed  a  2-D  Levinson  algorithm  for  the  2-D  case  which  has  the 
same  structure  as  1-D  Levinson  recursion.  Suppose  that  we  have  a  positive  definite  auto¬ 


correlation  sequence  :  |/?  (&,/);  (0,0)<(£ ,/  )<(N  M ) j  and  assume  that  we  know  the  solu 

tion  to  the  Normal  equations  for  :  2)  and  the  optimum  mean  square  error 

p(n/n-\)  for  orcjer  (njxi-1).  Then  the  solution  for  H(njn\zx,z2)  and  are  given 

by  (Marzetta,  1978, 1979, 1980): 


and : 


W2)  =  //(',’'n~1W2)  -  K(n’m\z{n.ztmJi(njn-1\z{1  ,z  il )  (2.35) 


p(njn)  —  p{njn- 1) 


fc(njn) 


2 1 


(2.36) 


where  AT(n''n);(/i=0,  l<m£A/),  (l£/iS/V,  is  the  2-D  reflection  coefficient 

sequence,  and  where  N  and  M  are  positive  integers  that  represent  the  order  of  the  filter. 
The  2-D  reflection  coefficients  are  defined  by 


g(njn) 


1 

p(njn- 1) 


E 


xin^nj-  l\kxJ(.1).x{nl-kx,n1-k1) 


x{nx-n,n2-m)  -  '£'£a(n-m  l\kuk2)  x(nx-n+kl,n2-m+k7) 

(k„*  2) 


(n  )  -  S Ia(n’m-1)(* ^2)  /?  ( n -k  ,/n-kJ  (2  37) 

The  idea  of  a  2-D  Levinson  algorithm  is  to  get  the  prediction  error  filter  H{NM\zx,z 2) 
from  these  reflection  coefficients  by  recursively  computing  a  sequence  of  2-D  finite  order 
prediction  error  filters,  H(n'm\z  Xyz2). 

Marzetta  (1978)  defined  a  2-D  stable,  linear,  shift-invariant  filter  to  be  one 
whose  unit  sample  response  is  absolutely  summable  and  defined  a  2-D  minimum  phase 
filter  to  be  a  2-D,  causal,  stable,  linear  shift-invariant  filter  which  has  a  causal,  stable 
inverse.  In  this  case  he  proved  that  if  H^n/n~l\zx,z2 )  is  analytic  minimum  phase,  then 
H(n’m\zx,Z2)  as  defined  by  Eqn.  (2.35)  is  also  analytic  minimum  phase.  He  defined  a 
2-D  filter  to  be  analytic  minimum  phase  if  the  filter  is  minimum  phase  and  also  analytic 
in  some  neighborhood  of  the  unit  bicircle. 


1 

p(njn- 1) 


2.  Marzetta  Approach  and  Burg  Aleorithm 


His  algorithm  is  simply  the  2-D  Levinson  algorithm,  used  under  the  assumption  that  the 


true  reflection  coefficient  sequence  K^njn\  vanishes  for  :  •!  0<n<A/-l,  I m  \>M 


and 


n  =N ,  m  <-M 


[■ .  He  showed  that  to  preserve  the  minimum  phase  property  required  an 


infinite  set  of  reflection  coefficients  defined  on  the  "  continuous  support  "  region  shown 


in  Figure  2.6.  Let  ■< 


a(nl,n^),0<n1<N  ,0<ji2^M  ,(ni,n2)^{0,0)y  be  a  set  of  2-D  FIR 


filter  coefficients,  for  a  NSHP  filter  with  finite  support.  Then 


H<NMXzl,z7)  =  1-  2  2  a(n1 

n  i=0n2=0 


,n2)z^zinl 


(nvn2  M0.0)  (2.38) 


Marzetta  showed  that  although  a  finite-extent  sequence  < 


a(nun2) 


results  in  an  infinite 


sequence  of  reflection  coefficients,  fortunately  the  converse  of  this  result  is  not  true.  If  a 
polynomial  is  defined  by  a  finite  sequence  of  reflection  coefficients  that  polyno¬ 

mial  will  still  have  finite  degree.  Furthermore,  if  \K{'njn)\  <1  for  all  (n,m),  that  polyno¬ 
mial  will  have  minimum  phase.  The  polynomial  can  be  found  from  the  reflection 
coefficients  K(njn)  by  means  of  the  following  recursion:  Starting  with  the  n=0  column, 
and  with  the  initial  condition: 


//(°’°)(z1^2)  =  1  (2.39) 

then  H{n’m\zl,z1)  for  (1  <m<M)  can  be  written  as 

H(n-m\zl,z2)=H{n'm-l\zl,z2)  -  K^n/n)  zfn  z2"m//(n'm-1)(z1-1  ,z2l )  (2.40) 

Then  shifting  to  the  next  column  i.e.  (n  >0,  - M<m<M )  and  starting  with  the  boundary 
condition: 


#<«.-(W+i»(z  Z2)  =  H{n-'lM\zl,z2) 


(2.41) 


this  recursion  is  repeated  for  all  values  of  (n,m)  for  which  K('n'm)±Q.  It  can  be  shown 
(Marzetta,1978)  that  the  value  of  K('n/n^  which  minimizes  the  new  prediction  error, 
P(n>m)is: 


rinsn)  _  1 


R  ( n  ,m  )-£  £/i  *  ^n/n~x\kxJcq)Ji  ( n  -k  i/n-kj) 
(M2) 


where  : 


//2(^-l>(z1^2)= 1  -  nk2 

(*1^2) 


(2.42) 


(2.43) 


The  new  prediction  error  is  then  : 


p(njn)  _  p(njn- 1) 


j((njn) 

J 


2n 


(2.44) 


It  can  further  be  shown  that  the  magnitude  of  )  is  always  less  than  one. 

Burg  developed  an  algorithm  for  estimating  1-D  prediction  coefficients  directly 
from  the  data  (Burg,  1967, 1968, 1975).  Burg  makes  no  assumption  about  the  univariate 
data  except  that  the  signal  is  treated  as  a  stationary  random  process.  No  estimation  of  the 
autocorrelation  function  is  required  in  this  algorithm. 

Marzetta  made  the  suggestion  to  extend  the  1-D  Burg  algorithm  to  2-D  (details 
are  given  in  Appendix  A  )  but  so  far  no  one  in  the  literature  has  tried  to  test  the  2-D  Burg 
algorithm.  We  have  done  this  however  and  we  will  illustrate  here  an  example  to  study 
the  behaviour  of  the  estimating  algorithm.  In  this  example  the  estimated  power  spectrum 
is  computed  by  combining  forward  and  backward  estimates  in  a  manner  similar  to  the 
combined  quadrant  technique.  Consider  a  signal  consisting  of  sinusoids  in  additive  noise 
as  given  in  Equation  (2.33).  The  results  are  shown  for  the  cases  of  one  sinusoid  and  two 
sinusoids.  In  case  of  one  sinusoid  (L=l)  the  estimated  power  spectrum  is  given  for  a 
dataset  size  of  64x64  and  frequency  values  of  con  =  0)12  =  k/2.  A  third  order  model  is 
used.  Figure  2.7a  shows  the  results  of  the  forward  NSHP  model,  while  the  backward 
NSHP  model  results  are  shown  in  Figure  2.7b.  It  is  clear  that  using  either  the  forward  or 
the  backward  NSHP  models  alone  results  in  a  spreading  of  the  peak  in  one  direction. 
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The  combined  estimate  using  both  forward  and  backward  models  gives  a  more  accurate 
result  as  shown  in  Fig.  2.7c. 

In  the  second  example  the  results  are  shown  for  the  case  of  two-sinusoids  ( L— 2), 
with  (On  =  co12  =  CO22  =  7t/2  and  (D^  =  rc/5.  This  results  are  compared  with  the  results  of 
the  AR  (NSHP)  model  for  the  same  test  case.  Figure  2.8a  shows  the  results  of  the  com¬ 
bined  forward  and  backward  NSHP  Burg  technique,  while  Fig.  2.8b  shows  the  results  of 
the  AR  model.  A  third  order  model  is  used  in  both  cases.  This  example  shows  the  high 
resolution  properties  of  the  AR  relative  to  Burg  technique.  We  have  examined  many  dif¬ 
ferent  examples,  with  different  choices  of  the  peak  positions,  we  observed  that  AR  model 
has  consistently  better  resolution  properties  than  Burg  algorithm. 

E.  2-D  DIRECT  METHOD  OF  SPECTRAL  ESTIMATION 

Therrien  (1981)  pointed  out  that,  there  exists  a  close  relation  between  2-D  linear 
prediction  problems  and  multichannel  1-D  linear  prediction  problems.  He  suggested  a 
procedure  for  solving  2-D  normal  equation  by  relating  the  2-D  linear  prediction  problem 
to  a  multichannel  linear  prediction  problem  and  applying  the  multichannel  Levinson 
recursion.  These  results  can  be  used  to  formulate  a  method  for  estimating  the  2-D  model 
parameters  without  explicit  prior  estimation  of  the  correlation  function  and  we  will  refer 
to  it  as  a  direct  method.  The  procedure  can  be  summarized  briefly  as  follows:  Consider  a 
2-D  stationary  zero-mean  random  process  x(nj,n 2)  defined  over  a  rectangular  support 

region.  The  linear  prediction  equation  is  of  the  form: 

/ 

Pi-l/Vl 

x( L  a/l,Vc (" I- * i’11 2~ 1 2)  .  (M2)*(0.0)  (2.45) 

1 1=0 1 2=0 

The  corresponding  Normal  equation  is  given  by  Eqn.  (2.17)  but  repeated  here  for 


convenience 


(b)  AR  model  method 


where 


[RCPi-l)  R^i-2)  •  •  •  R(0)  j  |aw',-1,J  [  0  J 


and  where 


ak.  o 
ak.i 


akJ>z-l 


S(0)  = 


0 


where  a2  represents  the  variance  of  the  error  resulting  from  optimal  linear 
prediction. 

If  the  data  is  represented  as  a  vector 


x(n  ,0) 
x(n,  1) 


(2.48) 


x(nj>2-  0 


then  the  vector  x„  can  be  considered  as  consisting  of  the  components  of  a  1-D  multichan¬ 
nel  random  process  evolving  in  the  n  direction.  A  corresponding  1-D  P2'  channel  linear 
prediction  problem  can  be  written  as 


Pl_1  f  1 1 

-z  M 

»=i  j 


(2.49) 


where  a(‘  *  are  a  set  of  matrix  coefficients. 


•  •  <*&r\ 
■  ■  «i(^i 


a(i)  = 


(2.50) 


n  /y(«)  ( 

“pr-l.O  apjrU 


V  lJ’z-l 


The  values  of  the  a(‘ )  are  found  by  solving  a  set  of  Normal  equations 


■R(0)  R(-l) 

R(l)  R(0) 


R(-/>i+l)'|  Tj 
R(-/*i+2)  aO) 


(2.51) 


R^t-l)  R(P  i~2)  .  .  .  R(0)  J 

where  Ep  is  the  error  covariance  of  the  optimal  estimate. 


£/>,  =  ^[(V -xn)(xn-xjr 


(2.52) 


By  comparing  Equation  (2.46)  and  Equation  (2.51)  we  note  that  if  we  define  S(0)  such 


S(0)  =  EP  A(0) 


(2.53a) 


r's-'is.-r: 


V.v‘.n 


then  the  2-D  coefficients  can  be  computed  from 


A(,)  =  a(i)  A(0)  ,  i=l,2,  •••  Px-\  (2.53b) 

The  multichannel  Levinson  recursion  (  see  Appendix  B  )  can  be  used  to  solve  for  the  a(i ) 
and  EPi.  Then  Eqn.  (2.53)  can  be  used  to  solve  for  A(,)  and  a2.  That  is,  one  first  solves 

Eqn.  (2.53a)  for  A(0)  and  S(0\  then  applies  Eqn.  (2.53b)  to  compute  the  remaining  A(,). 
The  multichannel  Levinson  recursion  computes  both  the  calculation  of  the  forward  and 
backward  multichannel  linear  prediction  parameters  simultaneously.  The  backward 
parameters  relate  to  the  second  quadrant  2-D  filter  parameters  in  a  similar  manner  and 
thus  can  be  computed  from  equations  analogous  to  Eqn.  (2.53).  The  parameters  of  the 
first  and  third  quadrant  filters  are  identical.  The  second  and  fourth  filter  parameters  are 
also  identical. 

Now  instead  of  using  the  covariance  matrix  in  the  multichannel  Levinson  recursion 
to  solve  for  the  and  EPi,  we  apply  the  multichannel  form  of  the  Burg  technique  to 
calculate  the  parameters  direcdy  from  the  given  data.  Explicit  forms  of  the  Burg  method 
for  the  1-D  multichannel  case  were  developed  separately  by  Nuttall  (1976)  and  Strand 
(1977)  and  the  procedure  is  sometimes  called  the  Nuttal-Strand  algorithm  (Appendix  C). 
Then  application  of  Eqn.  (2.53)  gives  a  method  for  estimating  the  2-D  parameters 
directly  from  the  data. 

The  power  spectral  estimation  results  of  this  technique  are  compared  with  the  results 
of  estimation  by  first  estimating  the  correlation  function.  Our  method  for  estimating  the 
model  parameters  is  as  follows.  Suppose  we  are  given  an  array  of  2-D  data  of  size  Nx  - 
columns  and  N2  -  rows  as  shown  in  Fig.  2.9a.  Suppose  this  array  is  divided  into  L- 
sectors,  each  sector  is  a  subarray  of  size  M  by  Nj  ( M  =  Pj),  as  shown  in  Fig.  2.9b.  The 
2-D  data  set  x(nx,n  2)  is  mapped  into  a  1-D  M-channel  data  vector  \{k )  by  concatenating 
the  sectors  as  shown  in  Fig.  2.9c. 


Discontinuities  of  course  exist  at  the  points  of  concatenation  but  in  practice  these  are  not 
troublesome.  By  applying  the  multichannel  Burg  algorithm  (Appendix  C)  to  this  1-D 
multichannel  data  set  we  compute  the  estimated  values  of  the  filter  coefficients  a(‘  *  and 
also  the  error  covariance  matrix  Ep(.  Then  the  2-D  parameters  (  A(,)  and  o2  )  can  be 
obtained  from  Equations  (2.53a)  and  (2.53b). 

Since  the  method  estimates  both  the  forward  and  backward  multichannel  parameters 
we  have  the  2-D  filter  coefficients  for  first  and  second  quadrant  model  and  we  can  finally 
estimate  these  power  spectra  as 


51(o)1,co2)  = 


\H  ,(co1,o>2)  I 


(2.55a) 


5  2«a1,o)2)  =  — - -j 

l//2(C01,C02)  l 


(2.55b) 


where  the  parameters  are  defined  as  in  section  C.  Then  the  CQ  power  spectrum  estimate 
can  be  obtained  by  using  Equation  (2.26).  We  called  this  method  the  direct  CQ  method. 

It  is  of  interest  to  study  the  behaviour  of  the  CQ  algorithm  when  the  model  parame¬ 
ters  are  obtained  by  estimating  the  correlation  function  and  solving  the  Normal  equa¬ 
tions,  and  compare  the  results  with  the  case  where  the  parameters  are  obtained  directly 
from  the  given  data  by  using  the  new  algorithm.  The  performance  of  the  two  algorithms 
for  two  sinusoids  in  white  noise  has  been  investigated.  Suppose  a  2-D  random  process 
x(rt\,n 2)  is  defined  by 


i  v  W  -k  *  s 


x(nx,ri2)=  X  ci  cos  (n\0>il  +  n2(OiJ  +  w(nl,n-2) 
i=l 


(2.56) 


A  dataset  of  (64x64)  data  points  is  considered  here,  with  the  parameters  con  =  co12  = 

CO21  =  CO22  =  y ,  and  Cj  =  C2  =  4.  Two  methods  are  used  to  compute  the  power  spectral 

estimation.  In  the  first  case  the  1-D  multichannel  filter  parameters  are  computed  from  the 
prior  estimated  autocorrelation  function,  while  in  the  second  case  the  parameters  are 
computed  directly  from  the  data  ''CQ  direct  method). 

The  effect  of  model  order  is  also  illustrated  here,  by  considering  three  different  filter 
orders.  First  order  model  results  are  shown  in  Fig.  2.10.  In  both  techniques  the  resolu¬ 
tion  of  the  first  order  model  is  poor  and  only  one  peak  appears.  Fig.  2.11  shows  the 
results  of  computed  spectra  for  the  second  order  model;  the  resolution  in  this  case 
increases  and  the  two  peaks  appear.  Finally  Fig.  2.12  shows  the  results  of  power  spectral 
estimation  using  third  order  models.  It  is  clear  that  the  resolution  is  high  and  the  peaks 
are  very  close  to  the  correct  peak  locations.  Comparing  the  results  of  the  two  techniques, 
we  find  that  somewhat  better  resolution  results  are  obtained  for  direct  CQ  technique.  It 
will  be  seen  later  that  this  is  consistently  true  for  the  direct  method.  Also  we  observe 
that,  generally  for  both  techniques,  the  resolution  increases  with  increasing  order  of  the 
filter. 

F.  DISTANCE  MEASURES  BETWEEN  RANDOM  PROCESSES 

In  problems  of  communication  and  radar,  the  optimum  signals  (for  purposes  of 
detection)  are  those  that  minimize  the  probability  of  error.  However  in  many  cases,  direct 
minimization  of  the  probability  of  error  to  determine  an  optimum  signal  set  is  impossi¬ 
ble.  Since  an  explicit  analytical  expression  for  the  error  probability  may  be  too  difficult 
to  find  it  is  useful  to  search  for  signal  selection  criteria  that  although  weaker  than  the 
error  probability  are  easier  to  evaluate  and  manipulate.  In  the  search  for  suitable  criteria, 
the  notion  of  a  distance  between  two  probability  distributions  is  quite  useful.  One  such 


measure  is  the  statistical  divergence.  (Grettenberg,1963;  Hartwig,1972;  Hingorani  and 
Hancock,1965;  Kailath,l%7;  Kazakos  and  Papantoni-Kazakos,1980;  Marill  and 
Green,  1963;  Schweppe,1967) 

Let  f  i(x)  and  / 2(x)  represent  the  probability  density  functions  of  two  random 
processes  that  we  wish  to  discriminate  by  means  of  a  statistical  hypothesis  test.  By 
defining  the  logarithm  of  the  likelihood  ratio  A1(2  as 


Ai,2  =  log 


/  lOO 

fib) 


(2.57) 


then  the  mean  information  1(1,2)  is  defined  as; 


/(U) 


=  £,[au] 


(2.58) 


where  Ex[ .  ]  represents  the  expectation  under  the  distribution  f  v  The  divergence  was 
first  introduced  by  Jeffreys  (1946).  It  is  defined  as  the  difference  in  the  mean  values  of 
the  log  likelihood  ratio  under  the  two  hypotheses  Hx  and  H2.  That  is 


where; 


J  (1,2)  =  /  (1,2)  +  /  (2,1) 


/<2,1)  =  -E2[au] 


(2.59) 


(2.60) 


and  E  2[ .  ]  denotes  the  expectation  under  /  2. 

Since  spectral  estimation  involves  building  a  model  for  the  random  process  under 
consideration,  it  is  reasonable  to  use  the  divergence  measurement  to  compare  the  spectral 
estimates.  The  divergence  in  this  case  measures  the  ability  to  discriminate  between  the 
random  processes  produced  by  two  alternative  models.  Therefore  it  measures  the  close¬ 
ness  of  the  distributions  of  the  random  processes  and  the  closeness  of  two  corresponding 
spectral  estimates. 

Since  J  increases  with  each  additional  sample  of  the  random  process  one  needs  to 
specify  the  number  of  observations  of  the  random  process  to  compute  the  divergence. 
Alternatively  one  can  consider  the  change  in  divergence  that  occurs  with  each  new 
observation.  For  a  homogeneous  process  with  AR  model  this  increase  in  divergence  is 
constant  and  is  called  the  "incremental  divergence"  A/. 


In  the  1-D  multichannel  case,  the  incremental  divergence  A J  (1,2),  for  a  stationary 
Gaussain  random  AR  processes  with  zero  mean  can  be  expressed  (Fishman,  Jones,  and 
Therrien,  1981,1983)  either  in  terms  of  the  filter  parameters,  or  in  terms  of  the  estimated 
power  spectra.  The  results  for  1-D  random  processes  can  be  straightforwardly  extended 
to  the  2-D  case.  The  results  can  be  summarized  as  follows: 

1.  In  terms  of  filter  parameters: 

AJ (1,2)  =  ^  tr  A2  Rt  Aj  (of)-1  +  tr  At  R2  Af  (Oj2)'1  -  1  (2.61) 

where:  Ak  are  the  set  of  filter  coefficients  of  the  process  k,  (k=l,2).  Rk  represents  the 
entire  covariance  matrix  of  the  process  k,  and  a k  is  the  corresponding  error  covariance  of 
the  process  k  (equations  for  Ak,  Rfc,  and  ak  are  given  in  section  C  and  are  defined  by 
Eqns.  (2.16-2.18)). 

2.  In  terms  of  spectral  estimates: 


1  *  5j( o>)  *  S2(g>) 

AJ (1,2)  =  ——  f— — d(0+  da  -1 

8tc2  452(co)  4  5 ^cu) 


(2.62) 


where  Sk  (co)  denotes  the  power  spectral  of  the  process  k. 

In  this  section,  we  illustrate  two  different  examples  to  measure  the  incremental 
divergence  between  two  random  processes.  In  the  first  example  the  incremental  diver¬ 
gence  was  measured  by  estimating  the  filter  parameters  while  in  the  second  example  the 
measurement  was  computed  from  the  estimated  power  spectra. 

Example  (1): 

In  this  example  we  measure  the  incremental  divergence  between  the  first  and  second 
quadrant  models  as  a  function  of  model  order,  calculated  from  the  filter  parameters. 

Consider  a  signal  consisting  of  two-sinusoids  in  additive  noise,  as  given  in  Equation 
(2.56),  with  the  same  parameters,  except  C,  =  C2=  1.  The  incremental  divergence  in  this 


example  is  calculated  from  Eqn.  (2.61).  Fig.  2.13  shows  the  incremental  divergence 
between  the  first  and  second  quadrant  for  different  values  of  the  model  order.  The  value 
of  incremental  divergence  at  first  decreases  with  increasing  model  order.  This  implies 
that  the  first  and  second  quadrants  spectral  estimates  tend  to  get  closer  to  each  other  as 
the  model  order  increases.  Beyond  order  4  the  divergence  increases  then  decreases 
again.  Since  the  quadrant  models  are  not  sufficient  for  modeling  the  random  process,  this 
increase  in  divergence  with  order  is  not  necessarily  unexpected. 

Example  (2): 

In  this  example  we  make  a  comparison  between  the  nonsymmetric  half  plane  and  the 
quadrant-based  models.  Also  we  measure  the  incremental  divergence  between  quadrant 
plane  AR  models  and  CQ  model. 

Consider  the  previous  example  with  the  same  parameters.  The  comparisons  are 
made  for  these  models  by  numerically  integrating  the  computed  power  spectral  estimates 
to  obtain  the  incremental  divergence  for  different  model  orders.  Table  2.1a  shows  the 
results  of  comparing  NSHP  spectral  estimate  with  the  quadrant- based  model  estimates, 
while  Table  2.1b  shows  the  results  of  comparisons  between  the  CQ  model  and  first  and 
second  quadrant  plane  models.  The  tables  shows  closeness  between  NSHP  model  and 
the  CQ  model,  as  we  expect,  while  the  difference  between  the  NSHP  and  the  individual 
quadrant  plane  models  is  high.  Also  the  tables  indicates  that  in  general  the  spectral  esti¬ 
mates  tend  to  be  closer  as  the  model  order  increases. 
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TABLE  2.1  INCREMENTAL  DIVERGENCE  MEASUREMENT 


(a)  Quadrant  vs  NSHP  models 


Incremental  Divergence  (A/) 

Method 

Model  Order 

1 

2 

3 

1"  QP 

0.6827 

0.4054 

0.4004 

2nd  QP 

0.7171 

0.5332 

0.5287 

CQ 

0.1248 

0.0373 

. 

0.0348 

(b)  Quadrant  vs  CQ  models 


Incremental  Divergence  (A J) 

Method 

Model  Order 

1 

2 

3 

Ur  QP 

1.1574 

0.5188 

0.4964 

2nd  QP 

1.1474 

0.5099 

0.5043 

t 
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HI.  MULTICHANNEL  2-D  SIGNAL  MODELS 
FOR  SPECTRAL  ANALYSIS 


A.  FORM  OF  THE  DATA 

Multichannel  2-D  signals  will  be  represented  by  a  vector  valued  quantity  such  as 
x(nltn 2)  where  nj  and  are  integer- valued  time  or  space  indices  and  the  dimension  M 
of  x  is  equal  to  the  number  of  channels.  The  notation  x(n  [,/isj  used  to  represent  mul¬ 
tichannel  2-D  signals  may  refer  either  to  th e  function  x  or  to  the  value  of  the  function  x  at 
a  specific  point  (nl,n2)  for  all  the  given  channels.  The  meaning  will  normally  be  clear 
from  the  context.  An  example  of  x(n  ,,n  2)  is  sketched  in  Fig.  3. 1 .  which  shows  that  mul¬ 
tichannel  2-D  signals  can  be  viewed  as  M-planes  of  correlated  2-D  data 

[X  :(n  t.n-o  ] 

I 

!  1  .rt;> 
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Each  plane  xk  1  n  .  .n  2  >  1  k  =  1.2.  .  M  1  of  the  vector  represents  .1  2  1)  si  gnat  existing  :n 

one  of  the  channels 

Suppose  that  the  multichannel  2  D  signat  ;s  represented  'ver  a  hrute  re*,  targmar 
region  of  support  0<n  <S  and  1  >0/1 -«\\  •  It  the  vet tor  v.uued  signals  1:  pm-' 

•n  .n  <  •  are  organized  first  .  '•>•*  •hen  "  •  .:ir.  •  t  irg-r  .»•.  n  t  t . . 
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where 
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x(«!,  1) 


x(«i,JV2-l) 


This  idea  is  illustrated  in  Fig.  3.2  for  the  case  M=2. 


(3.2a) 


(3.2b) 


B.  ESTIMATION  OF  THE  CORRELATION  FUNCTION 

Let  x(n1,n2)  be  a  multichannel  2~D  stationary  random  signal.  Thus  the  mean  is  a 
vector  quantity  independent  of  n  l  and  n  2  and  defined  by 

=£^x(n1,n2)]  (3.3) 

and  the  correlation  and  covariance  are  MxM  matrix  functions  defined  (respectively)  by 


and 


R(kl,k1)  =  E 


(x(n  i,n 2)  xr(nl-kl,n2-k 2) 


(3.4a) 


C(kx,k2)  =  E 


x(n  j-*!, n2- 


T 1 


(3.4b) 


From  their  definitions  it  is  clear  that  the  correlation  and  covariance  functions  have  the 
following  symmetry  properties 
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(3.5a) 


C(k1JcJ  =  CT(-kl,-k2)  (3.5b) 

Spectral  estimation  techniques  are  based  on  the  assumption  that  the  correlation 
sequence  is  available.  Therefore,  to  apply  these  techniques  it  is  necessary  to  estimate  the 
correlation  from  the  available  data.  The  spectral  estimate  can  differ  significantly, 
depending  on  the  method  by  which  the  correlation  sequence  is  estimated. 

Suppose  we  are  given  (A^jV 2)  multichannel  2-D  data  points  and  our  main  interest  is 
in  estimating  the  autocorrelation  function.  As  a  preliminary  step,  we  must  first  estimate 
the  mean  m*.  The  usual  estimate,  namely  the  sample  mean, 

a6) 

provides  an  unbiased  and  consistent  estimate  of  mx. 

Now  assume  we  have  a  stationary  multichannel  2-D  random  process  with  zero  mean; 
i.e., 

mx=0  (3.7) 

then  the  autocovariance  sequence  is 


C  (k{Jc2)  =  E 


x(nx,n2)  xT(nx-kx,n2-k2) 


(3.8) 


which  is  also  equal  to  the  correlation  sequence  R(kx,k2).  One  estimate  for  the  autocorre¬ 
lation  sequence  is 


R(kx,k2)  = 


(Nx-\kx\)(N2-\k2\) 


£  £  x(n  x.n2)xT (n  1  k  x  I  .n  2-  I  k ; 


(3.9) 

where  I  k  ]  I  <N  x  and  \k2  \  <N 2.  It  can  easily  be  shown  that  Rik  x£  is  an  unbiased  esti¬ 
mate  of  R (kx,k2).  An  alternate  estimate  for  the  autocorrelation  sequence  is 


I 


R'(*l,*2)  = 


.  N\-lki  I— LA^r- 1 Jfcal— 1 

-  ■--  2  L  x(/Il,/i2)x7'(ni-l^iU2-l*2|)  (3.10) 

”1”2  /ij=0  «2=0 


which  differs  from  Eq.  (3.9)  only  in  the  multiplying  factor  in  the  front  of  the  sum.  Com¬ 
paring  Eqns.  (3.9)  and  (3.10)  we  find  that 


Ni-I*,l  N2-\k2\  * 

*(M2>=  „  WMj) 


(3.11) 


Consequently,  R^,^)  is  a  biased  estimate  of  the  autocorrelation  sequence,  although  it 
is  asymptotically  unbiased.  The  Fourier  transform  of  the  biased  estimate  is  guaranteed  to 
be  non-negative  definite  but  that  of  the  unbiased  estimate  is  not;  for  this  reason  the  biased 
estimator  is  used  more  often. 

In  our  work,  we  are  dealing  with  statistical  properties  of  multichannel  2-D  random 
signals.  The  signal  vectors  defined  in  Eqn.  (3.2)  can  be  characterized  by  mean  vectors 
and  correlation  matrices.  These  quantities  are  defined  by 


m  =  £  x 
R  =  E  xxT 


(3.12a) 


(3.12b) 


Note  that  for  stationary  random  processes  the  vector  m  will  consist  of  N  {N 2  M- 
dimensional  subvectors  all  equal  to  the  signal  mean  mx .  The  elements  of  mx  however 
are  not  in  general  all  equal. 

The  correlation  matrix  for  stationary  multichannel  2-D  random  processes  has  a 
specific  structure  and  symmetry  at  its  various  levels  of  partitioning  (Themen,1986). 
These  properties  are  shown  in  Figure  3.3.  Figure  3.4  shows  the  detailed  structure  of  the 
correlation  matrix  R  for  N  j  =  3 jV  2  =  4,  and  M  =  2.  The  scalar  elements  rff  are  given  by 


=  E  xl(nx,n2)Xj(nx-k,n2-l 


(3.13) 


which  represent  the  element  in  the  ilh  row  and  the  }  column  of  the  correlation  function 
R(*  ,/ ).  (Recall  that  R (k ,/ )  is  an  MxM  matrix). 
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Signal  Vector 


*0 

x(n,,0) 

*1 

x(nj,l) 

X  = 

where  Xnj  = 

• 

X  . 

I 

x(mj,V2-1) 

Correlation  Matrix 


R(o) 

R(-l) 


R=E 


R(l) 

R(0) 


.  .  .  R(iV,-l)' 
.  .  .  R(iVj-2) 


R(- iVj -1)  R(-jVj-2)  .  .  .  R(0) 


(BLOCK  TOEPL1TZ  WITH  N2M*  \2M  BLOCKS) 


where 


R(k)  =  E  xnxj_k  = 


R[k.  0)  R(k,l) 

R{k.~l)  R{k,  0) 


.  .  .  R(k,N2- 1) 
.  .  .  R{k.N2-2) 


R{k -y2~l)  R(k,-y2- 2)  .  .  .  R(k.O) 


where 


(BLOCK  TOEPLITZ  WITH  M  -  M  BLOCKS) 


R(kvk,)  RT(  A-,,  k2)  E[x(  rj,,n2)xr(n1  Ar t ,n2 -  Ar2) j 


(.VOr  TOEPLITZ) 


Fig.  3.3  The  correlation  matrix  (figure  taken  from  Therrien.1986). 


•  »# 


>t  *1*1 


Not*:  r  *  r.  * 

>1  1> 

The  matrix  is^  Block  Toeolitz  but  not  block  symmetric,  has  block  Toeolitz 
(but  not  block  symmetric)  blocks.  The  sub-blocks  ore 
symmetric  but  not  Toeolitz  in  general. 


C.  ESTIMATION  IN  THE  FREQUENCY  DOMAIN 

Stationary  random  signals  arc  characterized  in  the  frequency  domain  by  their  power 
spectral  density.  The  power  spectral  density  (or  simply  the  power  spectrum)  matrix  for  a 
multichannel  2-D  random  process  is  defined  by: 


S(co1,co2)=  2*  R(M2)e 


(3.14) 


where  ROfc^/fc^)  is  the  true  autocorrelation  function.  In  case  of  a  two-channel  (M=2)  ran¬ 
dom  process  the  spectral  matrix  is  given  by 


S  n*©!,  CO2)  Sl2(co1,to2)| 


SfO)!,^)  = 


(3.15) 


5  2l(°)l  •ClJ2)  522(t*3l»W2) 


where  Sufco^CDj)  and  S22(a)i’0iz)  represent  the  autospectrum  components  corresponding 
to  the  two  channels  while  S  ^(COj.CD^  and  ^ 2 1  (°°  1  ’ ^^2)  represent  the  cross  spectrum  com¬ 
ponents  (between  channels).  The  autospectrum  components  are  always  real  and  nonne¬ 
gative,  while  the  cross  spectrum  components  are  generally  complex.  For  the  case  of  a 
real  random  process  the  cross  spectrum  components  are  equal  in  magnitude  and  opposite 
in  phase.  Sometimes  the  2-D  magnitude  squared  coherency  (MSC)  is  used  instead  of  the 
magnitude  of  the  cross  spectrum.  This  quantity  (K2)  is  defined  by 


K^COj.COz)  = 


IS  12(01,(02)! 


S 1  i(C0j_,  0>2)5  22(^1 


(3.16) 


D.  MULTICHANNEL  2-D  RANDOM  PROCESS  AND 


LINEAR  PREDICTION 


The  methods  of  2-D  AR  spectrum  estimation  discussed  in  Chapter  II  can  be 
extended  to  the  multichannel  case.  Consider  a  stationary  multichannel  2-D  signal 
x{nx,ni).  The  linear  prediction  problem  is  to  predict  the  current  value  of  the  vector 
x(n  i,n2)  on  the  basis  of  all  "past"  values  of  the  signal  in  a  region  a.  The  estimated  value 


■•wv 


SR 


§11 

H 


takes  the  form 


x(n1>n2)  =  -  £  a£2x(«i-'i.*2-‘2) 

(<',,/i)ea  (3.17) 

(ill«1MO.O) 

where  the  parameters  A,-,^  are  MxM  matrix  coefficients.  Let  us  define 

Aoo  =  I  (3.18) 

where  I  is  the  identity  matrix.  Then  the  remaining  values  of  the  predictive  filter 
coefficients  Aiit-2  (i^Ea)  are  chosen  such  that  the  mean  square  values  of  the  com¬ 
ponents  of  the  prediction  error 


e(n  lt«2)  =  x(«  1^2)  “  *(»  1**2) 

=  2  AM2x^n  i~ii’n2_I2)  (3  19) 

(«i.i2)eo 

are  minimized.  Fig.  3.5  depicts  the  interrelationships.  The  correlation  function  of  the 
error  evaluated  at  lag  (0,0)  is  the  MxM  prediction  error  covariance  matrix  ZE. 


Ie  =  E 


ein^n^e7  (nhn  2) 


=  R£(0,0) 


The  mean-squared  error  is  given  by 


I  e(«i,n  2) 1 


=  E 


(n1,n2)e('ti,n2)]  =frS£ 


(3.20) 


(3.21) 


It  is  known  from  the  orthogonality  principle  (Papoulis,1981)  that  the  prediction  error 
of  the  signal  must  be  orthogonal  to  all  the  past  values  of  the  signal  used  to  estimate  the 
present  value. 


Ejx(n1-i1,/z2-/2)e7’(ni,«2)]  =  [o]  (M^ea,  diJz)*( 0.0) 
It  further  follows  from  Equations  (3.19),  (3.20)  and  (3.22)  that 


(3.22) 


Ze  =  £ 


x(nvn^£r  (nx,n2) 


(3.23) 
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These  last  two  equations  can  thus  be  written  as 


E  x(/t  i~i  1  ,n 2-i 2)e7' (n  i ,« 2)  j  =  £e5(/1,j2)  (M2)**  (3.24) 


Equation  (3.19)  can  be  put  in  a  matrix  form  as  follows 


e(n1,n2)  =  Ar  x„ 


(3.25) 


where  *ni„2  is  an  vector  of  the  ordered  values  x(n1-/1,/i2-J2)  appearing  in  Eqn.  (3.19) 
and  A  is  a  correspondingly  ordered  matrix  with  blocks  equal  to  the  AJjJ2  for  (ij,i2)ea. 
Substituting  into  Equation  (3.24)  we  have 


Xn1n2-£7'(«l.«2)  =^[xn1n2-Xnf1«J-  A 

=  £  x„,n2  •  xJ,n2]  A  =  S 


A  =  S 


R  .  A  =  S 


(3.26) 


(3.27) 


where  R  is  the  correlation  matrix  of  xni„2  and  S  is  matrix  of  size  corresponding  to  the 
matrix  A  with  blocks  defined  as  follows: 


(3.28a) 


H 


s<0)  = 


Z*' 

0 


(3.28b) 


We  will  refer  to  Eqn.  (3.27)  as  the  Normal  equations  for  2-D  multichannel  linear  predic¬ 
tion.  The  error  covariance  matrix  and  the  filter  coefficients  are  obtained  from  the  solu¬ 
tion  of  these  Normal  equations.  Specific  examples  of  these  equations  are  considered  in 
the  next  section. 


E.  LINEAR  PREDICTION  MODELS 

Let  x(n,,n 2)  be  an  arbitrary  zero-mean  random  process  and  let  xin^n^  denote  a 
prediction  estimate  of  the  random  variable  x(n1,n2).  Two  different  linear  prediction 
models  are  considered  here,  namely  the  quadrant  plane  model  (QP)  and  non- symmetric 
half  plane  (NSHP)  model. 

1.  Analysis  of  Quadrant  Plane  Models 

The  estimated  prediction  error  filter  coefficients,  and  the  prediction  error  covari¬ 
ance  can  be  evaluated  for  the  quadrant  plane  model  as  follows,  assuming  the  size  of  the 
region  of  support  a  is  P  jxP2  points.  Eqn.  (3.19)  can  be  written  in  the  form 

P  |-1  Pr- 1 

e(ni/t2)=X  Z  Ah>2X(ni~‘i<n2-‘2)  (3.29) 

1 1=0  I  2=0 

This  model  is  called  a  first  quadrant  model  since  the  impulse  response  of  this  filter  has 
support  in  only  the  first  quadrant  of  the  2-D  plane.  The  signal  data  needed  in  Eqn.  (3.19) 
is  in  the  range  nj-Pj-l  to  n,  and  n  ;-P  2+l  to  n  __  as  shown  in  Fig.  3.6,  where  the  solid 
dots  denote  prediction  error  samples  which  have  been  computed,  and  the  crosses 
represent  samples  that  remain  to  be  computed.  For  this  particular  case  of  a  quadrant 
plane  mode!  the  vector  x„ is  defined  by 
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'  A  M\ 


x(  n  ;  -t  .n  - 1 
i  x(n  .  -i  ,/i  1 ) 

*<.,-.=!  ,=<)i-  p 

i 

i 

x(n  [-1  ,n  2~P  2.*  '  ' 

L  * 

The  prediction  error  is  given  by  Eqn.  ( 3  25  > 
where  A  now  has  the  specific  form 


A  = 


,<0) 


k'W 


.(/»  ;-l  . 


and 


~a 


A'"  = 


Vo  1 


Vi 


* 


The  Normal  equations  for  this  region  of  support  van  be  viewed  as  follows  V^r  have  a  .c 


HJ 


.VV  VVV  V  \'V  '  '  s' 


>1  P  ‘P  points  as  shown  in  Fig  '  7a  and  a  corrclaoon  matnx  R  is  formed  for  the 
P  >P  points  Fhe  terms  of  the  correlation  function  appearing  in  the  matnx  are  shown  in 
hg  i  'b  Ihe  Normal  equations  have  the  general  form  of  Eqn.  (3.27).  The  specific 
structure  in  this  vase  is  as  follows 


R 1 0 1  Roll 

r;  R) 

R(-A,,+ir 

R< 

■A,o,  - 

a'!) 

'  s<of 

0 

Ri F*  i  .  RiA»j-2i 

R(0) 

. 

0 

where 


(3.33a) 


r  Rot  .0)  Rt*.-h 

Rot  .1 1  Ri*.0) 


R  (k-P  2+1)’ 
R(*.-Pi+2) 

’  t.  ' 


(3.33b) 


R(*^2-l)  R(kJ>r2)  RCA  .0) 


S' J'  = 


( 3  3  3c  i 


0 


and  where 


H4 


R(*  i  ,k  2)  =  Rr  <rk  1  -k  2)  =  E  ^ x(n  x  ,n  ^x7  (n  1-/:  1  ,n  2-k  2)] 

>11^1^2)  rn(kx,k£  .  .  .  rlM(kxJc2)- 
r2l(*l>*2)  r  22(^1  .*2)  •  •  •  r  2M  l  2) 


(3.33d) 


rA/l(^l>^2)  rA/2^1^2)  •  •  •  rWAf(^1^2) 


^o.o- 1 


(3.33c) 


Ze  =  E  (x 


K-X)(X-X)T 


(3.330 


"  2  2 
On  <*12 

^21  <*22 


CTW1  aM2  •  ■  • 

As  mentioned  before  R  is  an  MP  XP  2xMP  XP  2  matrix  and  A  and  S  are  each  MP  XP 
matrices.  Note  that  the  matrix  R  defined  in  Eqn.  (3.33)  differs  from  that  in  Fig.  3.3  in 
that  the  first  level  partitions  are  block  transposed.  This  results  from  the  particular  order 
ing  of  the  vector  x„  „  (Eqn.  (3.30))  which  is  used  to  define  the  2-D  multichannel  linear 
prediction  problem. 

The  previous  analysis  was  derived  for  the  case  where  the  filter  coefficients  V 
had  first  quadrant  support.  If  we  set  up  the  problem  assuming  second  quadrant  support 
In  this  case  the  prediction  error  filter  is  given  bv 


P,-\Pr\ 

e(rti/l2)=  £  I  Bi1.l*(#,l+‘l-/,2“*2> 

1 1=0  ij=0 


(3.34) 


where  B,tl2  represent  the  backward  predictive  filter  coefficients.  The  filter  is  actually 
similar  to  that  of  Fig.  3.6  except  it  predicts  the  point  in  the  upper  left  comer  and  we  get  a 
different  set  of  filter  coefficients.  We  can  set  up  a  matrix  equation  similar  to  Eqn  ( 3.27). 
but  now  the  matrix  B  reflects  the  second  quadrant  support 


RB  =  S, 


where  R  is  the  same  matrix  as  (3.27)  and  B  is  defined  bv 


B.a-i] 


B"  = 


I 


(3.35) 


3  3hai 


(336d) 


s  L0)  = 


0 

0 


L 


The  third  and  fourth  quadrant  filters  for  predicting  the  bottom  left  and  bottom 
right  points  are  also  distinct.  Their  Normal  equations  differ  from  the  Normal  equations 
of  the  first  and  second  quadrant  b  ters  in  that  the  innermost  blocks  Rt k  <  jk  -  >  of  the  matrix 
R  are  replaced  by  their  transposes  This  is  a  difference  from  the  single  channel  2  1)  case 
where  the  first  and  third  and  the  second  and  fourth  quadrant  filters  are  dentical 

2.  Analysis  of  Non-Svmmetric  Half  Plane  Model 

As  we  discussed  in  Chapter  II.  the  NSHP  model  is  an  important  form  of  predu 
tor  because  of  the  ability  to  factor  arbitrary  spectra  into  terms  with  torward  and  backward 
NSHP  support  Also  we  mentioned  that  models  with  NSHP  support  ot  modest  size s 
have  been  found  to  give  reasonable  results  in  spectral  estimation 

Now  we  consider  the  problem  of  finding  a  finite  outer  NSHP  model  as  sketched 
in  lug  '  Ha.  given  cosariaru.es  on  a  finite  window  as  shown  m  1  ig  ;  Hb  Dir  predu  non 
equation  tor  a  i  /  ’  ./’  - 1  order  N SUP  model  is 


x  i  n  'i 


'  \  X 


r  r . 

S  '  \  x  ■  n 


1  ,  jll  1  ’  ■  * 


•f  V  ' 


'  \ 
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where 


(3.39) 


'x(nun2- 1) 
x(fl1,n2-2) 


x 


*i 


(3.40a) 


x(n1,n2-/>2+D 


x(n1-i,rt2+/>2-l) 


x|n  ,-i  ,n  O 


VH|-|  ,/l  s~P  ;+l. 


1=1.2. ....  P  .-I 


(3.40b) 


Note  that  the  si/e  <4  the  subsector  x,  is  smaller  than  that  ot  x„  (  tor  Isi  *. /’ .  It  \  is  i 
he  a  minimum  sarian.e  prrtiii. lion  estimate  1  e  mie  that  mimmi/es  mean  square  ermr 
then  ttie  mhoijofi.il  its  pruuiplr  •  u.st  ttoui  a:ui  r.uls  to  the  following  Normai  equation. 


•R'(0)  R'(-l) 

R'(l)  R(0) 

.  R'HVD' 
•  RHV2) 

A*®  ‘ 

0 

R Vpl)  R(/»i-2)  .  . 

•  R(0) 

A(/>1_1) 

0 

-R(*,0)  R(/t,-l)  .  .  .  R(k,-2P2+\y 

R(k,l)  R(*,0)  .  .  .  R(k-P2+2) 
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R(0,-1) 
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R'(O)  = 


(3.42d) 


R(0^2-l)  R(0^2-2)  .  .  .  R(0,0) 

’^0,0 

^0,1 

A<°>*  ' 


(3.42e) 


S^0)  = 


(3.420 


and  where 


If=f  j  (x-\)(\-\)T 

c.  - 


l3.42e) 


and  Rlifc  is  defined  by  Eqri.  (3.33d).  VVe  refer  to  this  as  the  /<>rHdrJ  prediction 


model. 


For  backward  prediction  of'  the  process  x  we  suppose  that  samples 


xtn  <-P  .♦  1  ,n  2~P  2*  1  '•  xtn  .  .n  «-  1  >  are  available  as  shown  in  Fi£  3  10,  and  we 
attempt  a  linear  prcdictu>n  of  \,  n  ..no  according  tv' 


X„  'i  no  V  K  X  n  n  X  X'  B  X'  n 
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The  backward  prediction  error  is  defined  as 


eb(nl,nz)  =  x{n<l,n 2)  ~  (n1,n2) 

(3.44) 

Pr  1  /’.-l  Pi-l 

=  Z  Bg, jXf/z j ,/s 2+i 2)  ■*"  Z  Z  B(i,2x(rt !+/!,« 2+i 2) 

(J=0  <  1=1  '2=-^  2-1 

If  the  backward  prediction  error  filter  coefficients  B,-^  are  chosen  to  minimize  the  mean 
square  error  I  (n  1,n2) 1 2  then  the  corresponding  Normal  equations  are  given  by 
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R'r(-P,+l)l  rB(/*.-i>]  r  0 
R't(-P  i+2)  0 


R,r  (?!-!)  R'r  (P  !~2)  .  .  .  R/7(0) 
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and  the  other  terms  are  as  defined  earlier.  It  is  clear  from  the  previous  analysis  that,  the 
forward  and  backward  models  have  distinctly  different  Normal  equations.  More 
specifically,  if  we  reorder  the  Equations  (3.45)  so  that  they  have  the  same  partitioning  as 
(3.41)  we  find  that  the  difference  between  the  two  sets  of  equations  is  that  the  innermost 
correlation  block  RO^,*^  is  transposed.  The  solution  of  forward  and  backward  Normal 
equations  leads  to  different  model  parameters  (filter  coefficients  and  error  covariances). 
Since  the  model  parameters  are  not  identical,  different  spectral  estimates  for  the  forward 
and  backward  models  are  obtained.  The  forward  and  backward  models  are  very  impor¬ 
tant  and  will  be  used  in  later  sections. 


F.  AUTOREGRESSIVE  MODEL  FITTING 

If  x(n  x,n2)  is  a  zero  mean  stationary  multichannel  2-D  random  process,  then  a  causal 
representation  of  the  type 

x(n1n2)  =  -  £  x(rt1-i1,/t2-t‘2)  +  w(ni,/t2) 

OV^ea  (3-47) 

Ot.iiMO.0) 

is  called  an  autoregressive  (AR)  representation.  The  sequence  v/{nx,n^  is  a  zero  mean 
white  noise  random  process  independent  of  the  past  outputs  with  covariance 

Z*  =E^w(n1,/i2)  ^(n^/tj)]  (3.48) 

If  x(n  i ,n2)  is  an  AR  process  defined  by  (3.47)  then  the  quantity 

x(n1n2)  =  -  £  A^x^-ij^-ij) 

(i,./2)EO  (3.4V) 

(«1,iJM0.0) 

is  the  best  mean- square  predictor  of  x(nltn2)  based  on  all  of  its  past.  Thus  equation 
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(3.47)  becomes 


x(rti,/i2)  =  x(ni,rt2)  +  w(ni,rt2)  (3.50) 

which  says  the  sample  at  (n^n^)  is  the  sum  of  its  minimum  variance  causal  prediction 
estimate  plus  the  prediction  error  which  is  white  noise.  This  is  also  called  the  innova¬ 
tions  representation. 

Now  define  the  polynomial 

H(zltZ2)=  2  Af,,*!-1  z2-"1  (3.51) 

n1/i2eo 

Then  it  can  be  seen  from  Eqn.  (3.47)  that  the  transfer  function  of  an  AR  representation  is 
[h(z^2)]  1  which  is  an  all  pole  model.  Figure  3.11  illustrates  the  relation  between 

linear  prediction  and  autoregressive  modeling.  While  linear  prediction  uses  an  FIR  filter, 
AR  modeling  employs  a  recursive  or  IIR  filter. 

In  general  when  linear  prediction  is  applied  to  arbitrary  2-D  random  process,  a  white 
noise  error  signal  does  not  result.  Nevertheless,  for  sufficiently  large  regions  a  the  resi¬ 
dual  may  be  found  to  be  close  to  a  white  noise  process  in  practice.  This  is  true  especially 
when  the  model  has  NS  HP  support  For  a  model  with  NSHP  support  the  AR  model 
becomes  a  close  approximation  to  any  given  random  process  if  the  order  becomes 
sufficiently  large.  This  argument  follows  from  results  in  Refs.  (Ekstrom  and 
Woods,1976)  and  (Marzetta,1978)  that  show  that  arbitrary  2-D  power  spectra  can  be  fac¬ 
tored  into  forward  and  backward  models  with  (generally  infinite)  NSHP  support  Thus  it 
is  reasonable  to  seek  to  represent  2-D  random  processes  by  suitable  AR  models  and  to 
derive  the  parameters  by  formulating  and  solving  Normal  equations. 

G.  MULTICHANNEL  2-D  BURG  TECHNIQUE 

There  has  been  in  the  past  decade  strong  interest  and  much  activity  in  developing 
high  resolution  power  spectrum  estimation  techniques,  particularly  for  short  data  records. 
The  Burg  maximum  entropy  method  (MEM)  and  the  associated  algorithms  for  estimating 
the  AR  model  parameters  directly  from  the  data  have  especially  received  much  attention 
in  this  regard. 


x(ni,n2) 


€(ni,n2) 


(a)  Linear  prediction 


w(ni,n2) 


x(n1-n2) 


(b)  AR  modeling 


Fig.  3.11  Linear  prediction  and  AR  modeling. 
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A  2-D  multichannel  Burg  algorithm  can  be  developed  as  a  straightforward  combina¬ 
tion  of  a  2-D  Burg  algorithm  (see  Chapter  II ),  and  a  1-D  multichannel  Burg  algorithm 
(details  are  given  in  Appendix  D). 

As  in  the  case  of  the  single  channel  2-D  Burg  algorithm,  this  method  does  not  result 
in  spectral  estimates  (perhaps  because  procedure  does  not  in  fact  solve  the  Normal  equa¬ 
tions  and  the  low  order  terms  in  the  recursion  do  not  result  in  reasonable  approximations 
to  whitening  filters).  Fortunately,  it  is  possible  to  generate  another  procedure  that  esti¬ 
mates  the  filter  parameters  directly  from  the  data  and  does  result  in  good  spectral  esti¬ 
mates.  This  procedure  is  discussed  in  the  next  section. 

H.  DIRECT  PARAMETER  ESTIMATION  TECHNIQUE 

1.  Multidimensional  and  Multichannel  Relations 

As  we  mentioned  in  Chapter  II  there  is  an  exact  relation  between  single  channel 
2-D  linear  prediction  and  multichannel  1-D  linear  prediction  (Therrien,1981).  These 
kinds  of  relations  also  exist  for  multidimensional  linear  prediction  problems  of  higher 
orders. 

In  this  thesis  we  will  specifically  take  advantage  of  the  relations  between  mul¬ 
tichannel  2-D  linear  prediction  problems  and  higher  order  1-D  multichannel  linear  pred¬ 
iction  problems.  The  results  will  be  used  to  estimate  the  model  parameters  of  a  mul¬ 
tichannel  2-D  random  process  directly  from  the  data  without  prior  estimation  of  the  auto¬ 
correlation  function. 

A  multichannel  linear  prediction  problem  with  first  quadrant  support  of  order 
(P1XP2)  and  widi  three  channels  is  shown  in  Fig.  3.12a.  In  this  case  the  points  sur¬ 
rounded  by  the  cylinder  are  predicted  from  all  the  other  points  appearing  in  the  three 
planes.  The  Normal  equations  are  given  generally  by  Eqn.  (3.27)  and  specifically  for  this 
case  by  Eqns.  (3.32)  and  (3.33).  The  corresponding  multichannel  1-D  linear  prediction 
problem  is  shown  in  Fig.  (3.12b).  In  this  figure  the  same  data  of  Fig.  3.12a  are  used  but 
these  data  are  now  considered  to  be  a  collection  of  discrete  1-D  signals  evolving  along 
the  direction.  The  total  number  of  these  signals  is  A/F2  and  they  form  an 
M/^-dlmensional  multichannel  random  process.  In  Fig.  3.12b  the  single  value  of  ri\  at 
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which  the  MP  2-channels  of  the  process  are  to  be  predicted  (from  the  signals  at  earlier 
values  of  n  j)  is  represented  by  all  the  points  that  appear  in  the  solid-line  box.  Assume  x„ 
is  a  vector  of  the  data  points  to  be  predicted 


x(n  ,0) 
x(n,l) 


*n  = 


(3.52) 


x(nj>  2-l) 

In  this  case  the  /^-channel  1-D  linear  prediction  problem  takes  the  form: 


Pi-lr  'i  T 

=  -  £  [“'  ]  *»-■ 
i=  1  ^ 


(3.53) 


where  x„  is  the  estimated  value  of  the  PjM  -dimensional  data  vector  x„  and  a(i)  is  the 
linear  predictive  filter  coefficients.  The  corresponding  Normal  equations  can  be  written 
as  follows: 


R(0) 

R(l) 


R(-l) 

R(0) 


R(-/,i+2) 


R(P,-1)  R^j-2)  .  .  .  R(0) 


I 

'v 

aO) 

0 

• 

0 

(3.54) 


where 


101 


n 


.V 

.V 

.V 


!i 


a 

r 


U 

.v 


•  I 


.T 

; 

I 

I 

■A 


(3.55) 


'<W  .  .  .  a$rl 

aio5  On  •  •  •  «i^i 

a(i>=  ' 

Oprl.0  «,&u  •  •  •  a£-i j>ri 

and  where  each  block  a $  is  of  size  MxM  and  EPi  is  the  P-^'xPjM  prediction  error 
covariance. 

Now  note  that  since  the  correlation  matrix  appearing  on  the  left  side  of  (3.54)  is 
the  same  as  that  in  (3.33a)  we  can  multiply  both  sides  of  (3.54)  by  the  term  A(0)  and 
compare  the  result  to  (3.33a).  Then  (3.54)  will  be  identical  to  (3.33a)  if  we  require 

EPi  A(0)  =  S(0)  (3.56) 

and 

A(0  =  a(,)  A(0)  /=0,1, ....  (3.57) 

The  foregoing  analysis  shows  that  the  multichannel  2-D  Normal  equations  can 
be  solved  by  the  following  steps: 

( 1 )  Solve  the  /* -dimensional  1-D  multichannel  problem  (3.54)  using  the  1-D  mul¬ 
tichannel  form  of  the  Levinson  recursion.  This  finds  the  coefficient  matrices  a.in) 
and  the  prediction  error  covariance  matrix  E^. 

(  2  )  Solve  (3.56)  for  A(0).  Since  EPi  is  not  in  general  Block  Toeplitz,  no  particularly 
fast  algorithm  is  possible. 

(  3  )  Find  the  multichannel  2-D  coefficients  from  (3.57). 

This  approach  to  solving  the  multichannel  2-D  Normal  equations  has  a  special 
advantage  for  spectral  estimation  using  the  combined  quadrant  method.  Specifically,  the 
multichannel  form  of  the  Levinson  recursion  leads  to  find  both  the  forward  and  back¬ 
ward  1-D  filter  parameters.  The  backward  parameters  relate  to  the  second  quadrant  mul¬ 
tichannel  2-D  filter  parameters  in  the  same  way  that  the  forward  parameters  relate  to  the 


first  quadrant  multichannel  2-D  filter  parameters.  If  and  E/>1  represent  the  backward 

parameters  and  B(,)  and  S^0)  represent  the  second  quadrant  parameters,  the  relations 
analogous  to  Eqns.  (3.56)  and  (3.57)  are 


and 


E'Pl  B(0)  =  Si0) 


(3.58) 


B(i)  =  p(')  B(0) ;  i=0,l,  ...,/>1-l  (3.59) 

Both  the  A(,)  and  the  B(l)  parameters  are  used  in  the  CQ  spectral  estimation  procedure 
and  both  are  found  in  a  single  estimation  algorithm. 


2.  Application  to  Quadrant  Plane  Models 

The  algorithm  for  solving  the  2-D  Normal  equadon  discussed  above  can  be 
further  enhanced  by  replacing  step  (1)  involving  solution  of  the  Normal  equations  by  a 
procedure  that  estimates  the  1-D  filter  parameters  directly  from  the  data.  This  method 
will  be  called  a  direct  method  since  it  does  not  require  prior  estimation  of  the  correlation 
function. 

This  method  capitalizes  on  the  existence  of  a  direct  method  for  solving  the  1-D 
multichannel  linear  prediction  problem  known  as  the  multichannel  Burg  algorithm  or  the 
Nuttall-Strand  algorithm.  The  method  is  based  on  ideas  originally  suggested  by  Burg 
(1975)  but  developed  separately  by  Nuttall  (1976)  and  Strand  (1977). 

In  applying  the  multichannel  Burg  algorithm  to  2-D  data,  the  data  are  first  parti¬ 
tioned  into  strips  along  the  n2  direction  as  shown  in  Fig.  3.13(a).  The  strips  of  width  P2 
for  an  7>iX/>2  quadrant  filter  are  catenated  along  the  nx  direction  as  shown  in  Fig. 
3.13(b).  As  in  the  previous  section  this  data  is  considered  to  be  a  1-D  process  (in  the  nx 
direction)  with  P channels.  Discontinuities  where  the  strips  are  catenated  together 
can  be  ignored  since  they  represent  only  a  small  portion  of  the  data.  The  multichannel 
Burg  algorithm  is  then  used  to  estimate  forward  and  backward  linear  prediction  parame¬ 
ters  a(I),  a(2)’  . . . ,  and  E/>1  and  |3(1),  P(2)l  . . . ,  and  E'Pi.  The  error 


covariance  matrices  are  used  to  form  (3.56)  and  (3.58).  These  equations  are  solved  by 
conventional  methods  and  the  multichannel  2-D  first  and  second  quadrant  filter 
coefficients  are  computed  from  (3.57)  and  (3.59). 

L  A  MEASURE  FOR  COMPARING  RANDOM  PROCESSES 

In  the  next  chapter  it  will  be  important  to  have  a  quantitative  measure  for  comparing 
the  similarity  of  two  spectral  estimates.  Such  a  measure  was  cited  in  Chapter  II  and  is 
based  on  the  information-theoretic  divergence.  The  divergence  measures  the  closeness  of 
the  distributions  for  two  random  processes.  Since  we  model  the  spectrum  as  that  of  a 
linear  filter  driven  by  white  noise,  it  is  reasonable  to  measure  the  closeness  of  two  spec¬ 
tral  estimates  as  the  divergence  between  random  processes  produced  by  the  models.  As 
discussed  earlier,  the  divergence  relates  to  the  ability  to  discriminate  between  the  two 
random  processes  in  a  classical  hypothesis  test.  When  the  white  noise  which  drives  the 
linear  filters  is  Gaussian  one  can  derive  an  explicit  closed-form  expression  for  the  diver¬ 
gence.  This  can  be  expressed  in  terms  of  the  model  parameters  or  in  terms  of  the  associ¬ 
ated  spectral  estimates. 

The  divergence  is  a  non-decreasing  function  of  the  number  of  samples  of  the  random 
process  used  in  the  hypothesis  test  For  linear  white  noise -driven  models  each  new  sam¬ 
ple  contributes  a  fixed  additive  amount  to  the  divergence.  This  quantity  has  been  called 
the  incremental  divergence  and  represents  the  information  available  for  discrimination 
of  two  random  processes  with  each  new  sample  that  is  observed. 

The  representation  of  incremental  divergence  AJ  (1,2)  between  two  multichannel  2- 
D  random  processes  can  be  written  in  terms  of  filter  parameters  or  in  terms  of  spectral 
estimates  as  follows. 


1.  In  Terms  of  Filter  Parameters 


tr  A2RiA2tS2  1  +  tr  A^A^Zj  (3.60) 

where  M  represents  the  number  of  channels.  A*  (k=l,2)  represents  the  set  of  filter 
coefficients  of  the  process  k,  R*  is  the  entire  covariance  matrix  of  the  process  k,  and 
finally  Z*  is  the  error  covariance  matrix  of  process  k. 

2.  In  Terms  of  Spectral  Estimates 


A/(U)  =  -M  + 


A/(i,2)  =  -M  +  ~r[\  f  tr  S1(co1,co2)S:£' ‘(CDj.co^dco^G^ 

8tr  -Vti 
ji  it 

+  j  jtr  S^cOj.co^Sf^co^co^dw^coJ  (3.61) 

-71-71 

where  Sj  and  S2  are  the  MxM  spectral  matrices  of  the  modeled  random  processes.  The 
above  expressions  for  the  incremental  divergence  for  multichannel  2-D  random  processes 
are  derived  in  Appendix  E.  The  expression  given  in  terms  of  spectral  estimates  (3.61)  is 
our  main  interest  in  the  present  research. 

Although  the  form  (3  61)  for  the  incremental  divergence  is  based  on  the 
existence  of  linear  models  for  the  spectra  and  S2  the  expression  (3.61)  can  be  simply 
defined  as  a  measure  for  comparing  any  two  spectral  estimates.  When  one  of  the  spectra 
(say  S2)  is  defined  by  the  CQ  method,  the  term  in  Eqn.  (3.61)  involving  the  inverse  is 
additive.  Unfortunately  the  other  term  is  not,  so  the  incremental  divergence  appears  to 
defy  any  further  interpretation. 
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IV.  MULTICHANNEL  2-D  AR  SPECTRUM  ANALYSIS 

This  chapter  is  concerned  with  multichannel  2-D  spectral  estimation  based  on  AR 
models.  Here  the  problem  is  to  obtain  accurate  spectral  estimates  given  a  finite  number 
of  data  samples.  We  begin  by  summarizing  the  spectral  estimation  models,  and  show 
that  since  the  NSHP  forward  and  backward  models  have  distincdy  different  model 
parameters,  in  general  this  leads  to  different  spectral  estimates.  Similarly  it  is  shown  that 
in  general  none  of  the  four  possible  multichannel  2-D  QP  models  gives  the  same  estimate 
for  the  spectral  matrix.  The  latter  suggests  a  generalization  of  the  procedure  used  by 
Jackson  and  Chien  for  combining  multiple  quadrant-based  spectral  estimates  into  a  single 
combined  spectral  estimate.  We  call  this  procedure  the  combined  quadrant  (CQ) 
method,  and  it  can  be  used  effectively  with  the  direct  method  of  parameter  estimation 
developed  in  Chapter  in.  A  comprehensive  set  of  experimental  studies  for  estimating  the 
entire  spectral  matrix  is  presented.  The  estimate  includes  the  2-D  autospectrum  for  each 
channel  and  magnitude  and  phase  of  the  cross  spectra.  The  estimates  compared 
correspond  to  NSHP,  CQ,  and  direct  CQ  models. 

A.  SPECTRUM  ESTIMATION  MODELS 

Parametric  approaches  to  spectral  estimation  are  based  on  an  a  priori  model  for  the 
spectrum  in  terms  of  a  finite  and  relatively  small  number  of  unknown  parameters.  The 
spectral  density  estimate  can  then  be  obtained  by  evaluating  a  formula  in  terms  of  the 
estimated  parameters.  When  a  good  parametric  model  for  the  spectrum  is  available,  esti¬ 
mates  of  the  model  parameters  can  be  obtained  using  a  number  of  different  approaches. 

Autoregressive  modeling  is  the  form  of  spectrum  estimation  that  will  be  examined 
here.  Filter  parameters  and  the  white  noise  covariance  are  estimated  by  the  techniques 
described  in  Chapter  III.  Let  the  linear  predictive  filter  transfer  function  be  given  by 

H(zi,z2)=  I  Af/jzf/lz2-,J  (41) 

(h./2)ea  v  ' 


Then  the  spectrum  of  the  random  process  can  be  expressed  as 


(4.2) 


SC©!, £02)  =  H~ ^COj.COi)  Sw  [  Hh  (©i.o^j 

where  L*,  represents  the  white  noise  spectral  matrix.  For  the  specific  case  when  the 
number  of  channels  (M)  is  equal  to  2  the  filter  has  the  form 


tfllCCOj.COi)  //12(©i,C02) 

^2l(“l.<02)  H  22(“l.£lJ2) 


(4.3) 


and  the  spectral  matrix  has  the  form  given  by  Eqn.  (3.15).  Thus  Eqn.  (4.2)  produces  esti¬ 
mates  of  the  autospectra  and  the  cross  spectra  all  at  once. 

For  purposes  of  developing  a  spectral  estimate,  different  regions  of  support  of  the 
AR  model  can  be  used.  We  are  interested  here  in  the  NSHP  and  QP  regions  of  support. 


1.  Non-Svmmetric  Half  Plane  Models 

As  we  mentioned  in  Chapter  HI,  NSHP  models  of  moderate  size  have  proven  to 
give  reasonable  results  for  spectrum  estimation.  The  spectrum  can  be  expressed  by 
Equation  (4.2)  where  H(z1,z2)  is  the  filter  transfer  function  for  the  given  region  of 
support 

Hd  „Z2)  =  I  +  X  22  1  +  XX  Aft,  z  f"  z?'  (4.4) 

1*2=1  il=ll2=>”/>2 

and  where  Aitii  represents  the  filter  coefficient  matrices  of  the  forward  predictor.  As  we 

will  see  in  section  B,  the  forward  and  backward  predictors  discussed  in  Chapter  IH  give 
different  spectral  estimates.  In  subsequent  sections  of  this  chapter  we  examine  results  of 
NSHP  spectral  models  of  various  orders.  Fig.  4.1  shows  the  definition  of  order  used  for 
NSHP  models. 


2.  Quadrant  Models 

Quadrant  support  regions  are  also  considered  for  the  AR  model.  These  are  of 
interest  because  of  their  convenience  of  computation  and  estimation  of  the  model  param¬ 
eters.  As  discussed  in  Chapter  II  there  exist  four  different  quadrant  plane  support 
models.  However  we  will  find  in  the  multichannel  2-D  case  that  none  of  the  models  is 
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Definition  of  order  for  NSHP  models 


the  same.  The  first  quadrant  filter  transfer  function  of  order  (P  2)  is 


P  P 

H,«0I,0)2)=  £  .  A«,=I  (4.5) 

ft  1^)  /i2=0 

where  A ni„2  represents  the  first  quadrant  filter  coefficients.  The  corresponding  power 
spectral  estimate  evaluated  from  H/  (co^co^)  is: 

S/(co1,£02)  =  H/-1(031,co2)£w,  [h^cOlCO^J  1  (4.6) 

where  'LW/  is  the  error  covariance  matrix  of  the  first  quadrant  model.  Similarly  the  spec¬ 
tral  estimate  of  the  second  quadrant  model  is  given  as 

S//(co1,£o2)  =  H/71(o)1,co2)2^„  [h#((d1,<d2)]  1  (4.7) 

where 

P  P 

H„(0)„0)2)=  £  s  ,  Boo=I  (4.8) 

«i=0  «i=0 

and  where  Bn  „2  are  the  second  quadrant  filter  coefficients  and  ’LWn  represents  the  error 
covariance  matrix  of  the  second  quadrant  model.  The  third  and  fourth  quadrant  filters  are 
given  by  similar  expressions  and  are  also  distinct  as  we  will  see  in  the  next  section. 

It  will  be  seen  that  first,  second,  third,  and  fourth  quadrant  AR  models  when 
used  individually  give  poor  estimates  of  the  spectrum  of  many  random  processes.  For 
the  analysis  of  sinusoids  in  noise  the  use  of  any  quadrant  filter  alone  results  in  a  spread¬ 
ing  of  the  peak  in  one  direction.  However  a  certain  combination  of  the  models  has  pro¬ 
ven  to  give  good  results  for  estimation  of  spectra  in  2-D  single  channel  problems  (Jack- 
son  &  Chien,  1979).  In  section  B  we  propose  a  generalization  of  this  method  to  2-D  mul¬ 
tichannel  spectrum  estimation.  The  definition  of  order  used  for  QP  models  is  shown  in 


The  forward  and  backward  AR  models  for  1-D  or  2-D  single  channel  signals  have 
the  same  power  spectra.  That  is,  the  estimates  produced  by  these  models  are  always 
identical.  The  reason  for  this  is  as  follows:  The  correlation  functions  in  the  single  chan¬ 
nel  case  have  the  symmetry  properties  R(k)  =  R(-k)  for  1-D  and  R  (k  =  R  {-k x ,-k2) 

for  2-D.  Because  of  this  the  correlation  matrices  appearing  in  the  Normal  equations  for 
the  forward  and  backward  models  are  identical  and  thus  lead  to  identical  model  parame¬ 
ters  (i.e.  filter  coefficients  and  error  covariance).  This  in  turn  leads  to  identical  spectral 
estimates. 


In  the  1-D  multichannel  case  the  correlation  matrices  appearing  in  the  Normal  equa¬ 
tions  for  the  forward  and  backward  models  are  not  identical  because  the  (matrix)  correla¬ 
tion  function  satisfies  only  the  symmetry  condition  R(k)=RT  (-k).  Specifically  the 
matrices  in  the  Normal  equations  differ  in  that  the  innermost  blocks  are  transposed. 
Likewise  for  the  2-D  multichannel  case  the  correlation  function  satisfies  only 
R(k  iJcj)  =  R  {-kx-kf)  and  the  Normal  equations  have  innermost  blocks  transposed. 
The  solution  of  these  Normal  equations  gives  different  forward  and  backward  model 
parameters.  However  Nuttall  (1976)  shows  that  the  forward  and  backward  1-D  mul¬ 
tichannel  power  spectral  estimates  are  identical.  The  proof  is  based  on  the  existence  of 
an  underlying  maximum  entropy  random  process  that  matches  the  given  correlation  func¬ 
tion  and  extends  it.  The  common  spectral  estimate  computed  by  both  the  forward  and  the 
backward  models  is  the  true  spectrum  of  the  underlying  maximum  entropy  process  (see 
(Nuttall,  1976)  for  details). 

In  the  multichannel  2-D  case  it  is  not  in  general  possible  to  match  and  extend  the 
correlation  function.  Thus,  except  in  the  asymptotic  case  (i.e.  as  the  support  tends  to 
infinity)  there  is  no  underlying  process  that  the  two  models  have  in  common.  In  general 
this  leads  to  different  spectral  estimates  for  the  forward  and  backward  models  (see  Fig. 
4.3)  and  these  estimates  may  not  even  be  positive  definite.  This  is  a  significant  differ¬ 
ence  between  multichannel  2-D  AR  models  and  lower-dimensional  models.  Thus  the 
difference  between  forward  and  backward  2-D  multichannel  spectral  estimates  is  seen  to 
be  due  to  the  fact  that  the  correlation  function  in  2-D  may  not  be  extendible 


(Dickinson,  1980).  In  essence  there  may  be  no  underlying  random  process  that  would  be 
whitened  by  the  given  linear  predictive  filter. 

To  see  specifically  where  the  system  breaks  down  let  us  for  the  moment  assume  that 
the  linear  predictive  filter  truly  whitens  the  random  process  and  thus  corresponds  to  the 
AR  model  of  the  form 

w(n1,n2)=  £  Aj,x(i xx-ix,n2-i^  (49) 

It  follows  that 

£[w(n1,n2)wr(n1-*1,n2-*J  =  £  X  A^2R(*  i+j  Hi.*  2+7  2“*  2) A,  Ul 


(4.10) 


Taking  the  z  transform  of  (4.10)  and  making  the  substitution  of  variables  ln  =kn  +  jn  -in 
results  in 


X  A^.zfV  I  R(/„/j)*W  I  (4.11) 

Ouijea  /i=— ,/*=■—  0'iJ2)a* 


,  -/u  -h 


Finally  letting  zn=ejca'  and  using  the  definition  (4.1)  leads  to  (4.2)  where  S(q)1,cd2)  is 
seen  to  be  the  Fourier  transform  of  the  correlation  function: 

S(co1,ct)2)  =  (X°)  (4.12) 


It  is  clear  from  this  analysis  that  if  x(«j,n2)  also  satisfies  a  backward  form  of  AR 
model  then  (4.9)-(4.12)  is  also  be  obtained  except  that  the  A,i(  j  and  1^,  are  replaced  by 

the  backward  model  parameters  and  R(/ 1./2)  is  replaced  by  RT(/1,/2).  Further,  since 


Rr(/1,/2)  =  R(-/„-/2) 


(4.13) 


and  the  sum  in  (4.12)  is  over  all  values  of  /,  and  l2,  the  spectral  matrix  defined  by  (4.12) 
for  the  backward  problem  is  identical  to  that  for  the  forward  problem.  Thus  we  see  that 
when  an  AR  model  in  the  form  of  Eqn.  (4.9)  truly  exists  for  the  random  process,  then 
both  forward  and  backward  spectral  estimates  will  be  identical.  The  procedure  breaks 


down  if  we  must  use  a  colored  noise  to  obtain  a  model  m  the  form  of  Eqn.  (4.9).  In  this 
case  we  cannot  write  Eqn.  (4.1 1)  and  the  spectral  estimates  are  not  identical. 

The  spectral  matrix  corresponding  to  the  forward  linear  prediction  StCDi.o^)  and 
backward  linear  prediction  can  be  written  in  the  following  forms,  respectively 


S(£01,Ci>2)  =  H  (H^CO,,^))  1 

S/(£01,co2)  =  H'-1(o)1,0)2)rw  (H/|f(®1,CD2)r1 

where  H,  H'  represent  the  forward  and  backward  matrix  frequency  response 


(4.14) 


(4.15) 


H(ff  ,0)2)  =  £  A„inje 

(niji&a 


-./(coi/M+toyjz) 


H'(®1,CD2)  =  £  B«W 

(»i^«i)£a 


(4.16) 


(4.17) 


and  are  the  forward  and  backward  covariance  error  matrices,  respectively. 

In  the  limit,  as  a  approaches  a  NSHP  with  infinite  support,  the  linear  predictive 
model  and  the  AR  model  become  identical.  This  is  implicit  in  the  work  of  Ekstrom  and 
Woods  (1976),  Marzetta  (1978),  and  Jain  (1981).  In  this  case  the  spectrum  of  the  process 
is  truly  given  by  (4.14)  and  the  spectral  matrix  can  be  represented  in  terms  of  either  the 
forward  or  the  backward  model  parameters.  While  one  cannot  use  infinite  NSHP  sup¬ 
port  in  practical  applications,  larger  orders  can  approximate  the  limiting  case  and  the  two 
spectral  estimates  have  been  found  to  give  very  close  results. 

It  is  of  interest  to  consider  a  numerical  example  to  study  the  behaviour  of  forward 
and  backward  spectral  estimation.  This  example  is  concerned  with  the  power  spectral 
estimation  of  a  two-channel  2-D  single  sinusoid  in  additive  noise.  The  signals  generated 
in  channels  1  and  2  were 


x  i  (n  1,n2)  =  cos(n  jCDj+n  2^2)  +  w  j(n  ltn2) 
x  2(^1^  2)  =  cos(nlayi+n20)4+^)  +  w2(nl,n2) 


(4.18a) 


(4.18b) 


where  w^nj,/^)  and  w2(nl,n2)  are  zero-mean  independent  white  noise  signals. 


Spectrum  estimation  results  are  given  for  a  dataset  size  of  64x64.  The  forward  and  back¬ 
ward  matrix  parameters  of  the  AR  models  are  shown  in  Figs.  4.4  and  4.5  respectively.  It 
is  clear  that  they  are  not  the  same.  Table  4.1  shows  the  values  of  the  spectral  estimates  at 
co1=(02=0-  Observe  that  as  the  model  order  increases  the  value  of  the  individual  terms  in 
the  forward  and  backward  spectral  matrices  become  closer.  The  last  column  in  Table  4. 1 
gives  the  value  of  the  incremental  divergence  for  the  two  spectral  estimates.  This  quan¬ 
tity,  which  is  defined  in  Chapter  in,  measures  the  closeness  of  the  spectral  matrices  over 
all  frequencies.  Since  lower  numbers  represent  closer  spectra,  the  results  indicate  that 
the  forward  and  backward  spectral  estimates  tend  to  converge  as  the  model  order 
increases.  A  summary  of  forward  and  backward  prediction  model  characteristics  for  dif¬ 
ferent  random  processes  is  given  in  Table  4.2. 

1.  Combined  Quadrant  Models 

From  considers  dons  similar  to  those  above,  it  can  be  seen  that  in  general  none 
of  the  four  possible  multichannel  2-D  models  with  quadrant  support  (see  Fig.  4.6)  give 
the  same  estimate  for  the  spectral  matrix.  This  again  is  different  from  the  single  channel 
case  where  for  example  the  first  and  third  quadrant  models,  being  degenerate  cases  of 
forward  and  backward  NSHP  support,  produce  identical  spectral  estimates.  This  suggests 
a  generalization  of  the  procedure  used  by  Jackson  and  Chien  (1979)  for  single  channel 
2-D  spectrum  estimation.  In  particular,  a  combined  estimate  for  the  spectral  matrix  can 
be  formed  as 

SICO!,^)  =  (SfkoJi.C^)  +  S^l(C01,0)2)  +  S^/HcOj.COj)  +  S^COj.G^r1  (4.19) 

where  S/t  Sn,  S /// ,  and  S/y  are  the  spectral  matrices  computed  from  (4.14)  using  the  four 
possible  quadrant  models.  This  procedure  will  be  called  the  combined  quadrant  (CQ) 
method.  Although  it  appears  that  four  quadrant  models  would  be  required  in  general,  it 
was  found  experimentally  that  the  combination  of  only  first  and  second  quadrants  is 
sufficient  to  obtain  good  results. 

S(co1,o)2)  =  (S^1(co1,co2)  +  S^coj,^))-1  (4.20) 


This  form  is  analogous  to  the  form  proposed  by  Jackson  and  Chien  for  the  single  channel 
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Fig.  4.4  Forward  model  parameters  of  first  order  NSHP  model. 
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(a)  Backward  error  covariance  matrix 


(b)  Backward  filter  coefficients 


Fig.  4.5  Backward  NSHP  model  parameters  of  first  order. 
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TABLE  4.1  COMPARISON  BETWEEN  FORWARD  AND  BACKWARD 


SPECTRAL  ESTIMATES 


Order 

Forward  Spectral  Estimate  Backward  Spectral  Estimate 

Incremental  Divergence 

SF  (0,0) 

SB(  0,0) 

AJ 

'0.1754 

0.0325 

'0.1241 

0.0257 

1 

0.0325 

0.1575 

0.0257 

0.4518 

0.3739 

'0.1400 

0.0359 

'0.1069 

0.0030 

2 

0.0359 

0.1721 

0.0030 

0.2068 

0.0914 

'0.0812 

0.0192 

'0.0893 

0.0128 

3 

0.0192 

0.0836 

0.0128 

0.0732 

0.0769 

4 


0.0711  0.019* 


0.0744  0.020: 


0.0703 


Second  QP 


First  QP 


Third  QP 


Fourth  QP 


case.  From  Eqns.  (4.20)  and  (4.2)  one  can  therefore  write 

S(co1,co2)  =  [ H/^cOj.cDj)  H^coj.a^)  +  H^co^c^^H//^,^)]  1  (4.21) 

..  - 

When  we  are  interested  only  in  the  detection  of  sharp  peaks  in  the  spectrum  and  not  in 
the  true  power  level  the  following  simpler  form  can  be  used 


S(co1,0)2)  =  H/^co^co^H;  (coj.o)^  +  (colCi^)  (4.22) 

»  - 

(where  we  have  set  Z*  =  'LWu  =  I)  and  appears  to  produce  satisfactory  results. 

In  order  to  check  the  performance  of  the  four  quadrant  plane  power  spectral  esti¬ 
mate,  a  numerical  example  is  considered.  This  example  is  concerned  with  the  analysis  of 
spectra  for  two-channel  2-D  sinusoidal  signals  in  additive  noise.  The  signals  generated 
in  channels  1  and  2  were: 


L 

x  i(n  1^2)=Scos(n  1+n2a)i2)+w1(n  1,n2)  (4.23a) 

i=i 

L 

x2(nl,n2)='£ cos (n  x C0t 3+n 2®i 4+  0.  )+vv 2(*  l >n 2)  (4.23b) 

i=l 


where  and  w2(n1,n2)  are  zero  mean  independent  white  noise  signals,  and  L 


represents  the  number  of  sinusoids.  We  assume  L=2,  with  0)11=0)12=-^-)  to21=0)22=— , 

2  3 


(0i3c=C0i4=-^-,  co23=co24=Yi  <t>i=1.5  radian,  and  02=O.5  radian.  Spectrum  estimates  are 

given  for  a  dataset  size  of  (64x64)  and  second  order  filter.  The  cross  spectrum  amplitude 
for  first,  second,  third,  and  fourth  quadrants  is  shown  in  Fig.  4.7,  while  the  amplitude  and 
phase  of  the  cross  spectrum  of  the  combined  quadrant  estimate  (Eqn.  (4.20))  is  shown  in 
Fig.  4.8.  The  cross  spectrum  amplitude  and  phase  of  the  total  4-combined  quadrant  esti¬ 
mate  (Eqn.  (4.19))  is  shown  in  Figure  4.9.  Table  4.3  shows  values  of  the  incremental 
divergence  between  the  different  quadrant  plane  spectral  estimates.  Here  CQ4  represents 
the  four-quadrant  CQ  estimate  (Eqn.  (4.19)),  and  CQ2  represents  the  simple  two- 
quadrant  CQ  estimate  (Eqn.  (4.20)).  The  results  indicate  a  closeness  between  the  first 
and  third  quadrant  estimates,  second  and  fourth  quadrant  estimates,  and  the 
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First  QP  (b)  Second  QP 


estimation  of  two  sinusoids  in  noise  (contour  plot) 


EsaasosaBssas 


TABLE  4.3  INCREMENTAL  DIVERGENCE  OF  QUADRANT-BASED  MODELS 


(a)  Comparison  between  CQ4  (Eqn.  4.19)  and  quadrant-based  models. 


Comparison  between  CQA  & 

Incremental  divergence  (AJ) 

cq2 

0.02 

1"  QP 

0.2 

2*  QP 

0.2 

3 M  QP 

0.2 

4*  QP 

0.26 

(b)  Comparison  between  CQ2  (Eqn.  4.20)  and  quadrant- based  models. 


(c)  Comarison  between  quadrant-based  models. 


Comparison  between 

Incremental  divergence  (AJ) 

1"  and  2-  QP 

0.09 

1*  and  3rd  QP 

0.01 

1"  and  4*  QP 

0.12 

2nd  and  3rd  QP 

0.09 

2nd  and  4*  QP 

0.004 

3rd  and  4*  QP 

0.12 

CQ2  and  CQ\  estimates.  Although  Sjjj  is  not  identical  to  S /  and  S/y  is  not  identical  to 
S//  it  has  been  found  experimentally  in  several  cases  that  the  simpler  CQ  2  estimate  (Eqn. 
(4.20))  gives  results  very  close  to  the  total  CQ 4  estimate  (Eqn.  (4.19)).  It  will  be  seen 
also  that  the  CQ  2  estimate  produces  results  that  are  very  similar  to  the  NSHP  results  and 
usually  requires  less  computation.  Further  when  the  model  parameters  are  computed 
directly  from  the  data  as  described  in  section  III  H  the  estimate  was  found  to  have 
improved  resolution. 

C.  EXPERIMENTAL  RESULTS  USING  NSHP  MODELS 

A  comprehensive  set  of  experimental  studies  for  simulating  the  entire  spectral  matrix 
is  presented.  The  estimate  includes  the  2-D  autospectrum  for  each  channel  and  magni¬ 
tude  and  phase  of  the  cross  spectra.  Estimates  using  NSHP  models  are  first  computed  for 
sinusoids  in  white  noise  backgrounds.  A  set  of  experiments  were  performed  to  determine 
the  performance  of  the  spectrum  estimation  procedures  as  a  function  of  model  order, 
dataset  size,  and  signal-to-noise  ratio.  Shift  estimation  experiments  are  then  carried  out 
to  test  for  linearity  and  accuracy  of  the  phase  estimate  of  the  cross  spectra.  Finally  an 
example  of  system  identification  is  given  to  further  test  the  cross  spectral  estimate  when 
used  to  identify  an  unknown  2-D  system  on  the  basis  of  Input/output  measurements. 

1.  Sinusoids  in  Noise  Background  Experiments 

In  this  section  the  NSHP  model  is  used  to  obtain  the  estimated  spectra  of 
sinusoidal  signals  buried  in  white  noise.  The  estimated  spectra  are  computed  and  plotted. 

a.  Tone  Estimation 

Here  we  discuss  the  estimation  of  a  sinusoid  in  a  white  noise  background. 
Three  numerical  examples  are  presented  which  illustrate  the  results. 

(1)  Example  1 

This  example  is  concerned  with  the  analysis  of  spectra  for  a  two- 
channel  2-D  single  sinusoid  signal  with  different  phase  in  additive  noise.  The  signals 
generated  in  channels  1  and  2  are  given  by  Eqn.  (4.18).  Spectrum  estimation  results  are 
given  for  a  third  order  NSHP  filter.  Two  cases  are  considered  in  this  example.  In  the  first 


case  we  suppose  that  the  two  channels  have  the  same  frequency  but  different  phase 

C0i  =  o>2  =  Jt/3,  0)3  =  0)4  =  tt/3,  and  0  =  1  radiar j 

while  in  the  second  case,  we  assume  the  two  channels  have  different  frequencies  and 
different  phase: 

g>i  =  o>2  =  Tt/2,  CO3  =  (O4  =  7t/3.  and  0  =  1  radiar^ 

Fig.  4.10  shows  the  results  for  the  components  of  the  spectral  matrix  in  the  first  case. 
Only  the  cross  term  S^o^.o^)  is  shown  (magnitude  and  phase)  since  the  term 
52i(Wi,(02)  is  theoretically  and  numerically  identical.  The  results  show  a  distinct  peak  in 
each  of  the  three  components  5n,  512,  and  •S'22  corresponding  to  the  location  of  the 
sinusoid.  The  center  of  the  peak  is  accurately  located  near  (7t/3,7t/3).  Although  the  phase 
of  the  cross  spectrum  shows  various  artifacts  around  the  edge  of  the  region  (where  the 
magnitude  is  small  and  the  phase  is  that  of  the  noise)  the  phase  at  the  location  of  the 
sinusoid  is  accurately  estimated  to  be  1.01  radians. 

Fig.  4.11  shows  S^.o^)  for  the  second  case  (different  frequencies). 
The  power  spectrum  estimates  of  the  first  and  second  channels  Sn  and  5 22  have  a 
single-peak  at  the  location  of  the  sinusoid.  Although  the  cross  spectra  should  theoreti¬ 
cally  show  no  presense  of  sinusoids  some  small  amount  of  energy  is  detectable  at  those 
frequencies  in  the  cross  spectrum.  Similar  effects  have  been  observed  in  1-D  multichan¬ 
nel  spectrum  estimation  and  have  been  attributed  to  non-exact  pole-zero  cancellations  in 
the  estimate  for  the  cross  spectrum  (Marple  and  Nuttall,1986). 

(2)  Example  2 

Two-channel  2-D  signals  with  two  sinusoids  in  additive  noise  are  con¬ 
sidered  in  this  example.  The  signals  in  the  channels  are  defined  by  Eqn.  (4.23)  with  the 
parameters  given  above. 

The  power  spectrum  estimation  results  for  a  second  order  NSHP  model 
are  given  in  Fig.  4.12.  The  results  are  close  to  the  theoretical  results.  Sj^cOj.co^  and 
S22(°h,a>2)  show  that  the  two  sinusoids  are  easily  resolved.  The  estimated  amplitudes 
are  unequal,  but  this  characteristic  has  been  observed  in  even  1-D  AR  spectrum 
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Estimate  of  spectra  for  sinusoids  at  same  frequency 


estimates.  The  cross-spectrum  5 12(o)i,ci)2)  shows  the  sinusoids  resolved  and  the  phase 
estimate  is  =  1.495  radians  and  <}>2  =  0.511  radians  which  is  close  to  the  true  phase  of 
<j>!  =  1.5  radians  and  <{>2  =  0.5  radians. 

(3)  Example  3 

In  this  example  we  considered  three  sinuosids  in  each  channel.  The 
location  of  the  sinusoids  is  shown  in  Fig.  4.13.  The  sinusoids  are  impeded  in  white  noise 
as  in  the  previous  examples.  Again  a  dataset  of  size  64x64  was  used.  The  phase  of  the 
sinusoids  in  channel  2  differ  from  those  in  channel  1  by  1  radian. 

Figure  4.14  shows  3-D  and  contour  plots  for  the  components  of  the 
spectral  matrix  estimated  using  a  fourth  order  NSHP  model.  The  cross-lines  (+)  in  the 
contour  plots  show  the  true  location  of  the  sinusoids.  The  results  show  a  very  slight  bias 
in  estimation  of  the  position  of  the  sinusoids  in  the  autospectra  and  the  cross  spectra  and 
good  resolution.  There  is  almost  no  evidence  of  energy  from  sinusoids  cofl  and  coD 
appearing  in  the  spectrum  of  the  opposite  channel  or  in  the  cross  spectrum.  The  phase  of 
the  cross  spectrum  shows  various  artifacts  but  since  these  correspond  to  the  noise  back¬ 
ground,  they  are  of  no  particular  interest.  However  in  the  region  where  the  peaks  are 
located,  the  phase  is  slowly  varying  and  estimated  with  values  of  <J>j  =  0.92  radians,  and 
<j)2  =  0.95  radians  (at  the  peaks).  The  slow  variation  of  the  phase  in  the  region  near  the 
sinusoids  makes  its  estimation  relatively  insensitive  to  any  inaccuracies  in  location  of  the 
spectral  peaks. 

b.  Effects  of  Model  order  and  Dataset  size 

A  comprehensive  set  of  experiments  was  carried  out  to  determine  perfor¬ 
mance  of  the  spectrum  estimation  procedures  as  a  function  of  model  order  and  dataset 
size. 

The  results  showed  that  to  some  extent  the  lack  of  resolution  resulting  from 
a  small  dataset  size  could  be  compensated  for  by  choosing  a  larger  model  order.  There  is 
a  limit  to  this  trade-off  however  since  a  larger  model  has  more  parameters  and  thus 
should  require  more  data  to  estimate  parameters  that  are  statistically  reliable.  The  exper¬ 
imental  observation  may  be  restated  in  the  following  way.  When  the  dataset  is  large  a 
lower  order  model  can  produce  results  that  are  comparable  to  a  higher  order  model. 


f  /  f 


Estimate  of  three  sinusoids 


As  a  specific  example,  two-channel  2-D  sinusoidal  signals  in  additive  noise 
were  again  considered  The  signals  used  are  defined  by  (4.23)  with  the  number  of 

sinusoids  L=2,  and  other  parameters  w11=a)12=^-,  <^21=^22="' »  coi3=coi4='!r- 

*  J  ^ 

a)23=(024=-^-,  <t>!=l  radian,  and  <^=1  radian.  The  simulation  was  performed  for  different 

values  of  the  model  order  and  also  for  different  values  of  the  dataset  size.  Figure  4.15 
shows  the  effect  of  model  order  on  the  cross  spectral  estimation  5,12(co1,co2)  amplitude 
and  phase  when  the  dataset  size  is  64x64.  For  a  first  order  model  we  have  poor  resolu¬ 
tion  and  the  two  peaks  appear  as  only  one  peak.  The  resolution  increases  with  increasing 
order  and  the  phase  gets  very  close  to  the  actual  value. 

For  a  dataset  size  of  32x32  (not  shown)  we  get  approximately  the  same 
power  spectral  estimation  results  as  for  a  dataset  size  of  64x64,  although  in  the  64x64 
dataset  we  get  more  sharpness  in  the  peaks. 

The  cross  spectral  estimate  for  a  dataset  size  of  16x16  is  shown  in  Fig.  4.16 
using  three  different  model  orders.  Autospectra  are  not  shown.  The  second  peak  starts  to 
be  seen  for  the  second  order  and  the  two  peaks  are  found  in  their  correct  location  for  the 
third  order.  Figure  4.17  shows  the  cross  spectrum  estimate  for  a  dataset  size  of  8x8. 
Only  one  peak  appears  for  first  and  second  order  estimates.  Two  peaks  appear  when  the 
order  of  the  model  is  increased  to  third  order,  but  the  peaks  are  not  sharp  at  all. 

The  results  of  these  experiments  can  be  summarized  as  follows.  For  a 
second  order  filter  the  results  of  spectrum  estimation  using  a  32x32  point  dataset  were 
essentially  the  same  as  those  using  the  64x64  point  dataset.  For  smaller  datasets  the 
results  degraded  considerably.  However  the  resolution  obtained  using  a  second  order 
NSHP  filter  on  a  16x16  point  dataset  was  similar  to  that  obtained  with  a  third  order  filter 
on  a  8x8  point  dataset.  For  the  8x8  point  dataset  the  resolution  is  greatly  improved  by 
going  from  a  second  to  a  third  order  filter.  However  there  is  a  noticeable  bias  along  the 
direction  in  the  location  of  the  peaks.  Phase  estimates  remain  quite  stable.  Table  4.4 
shows  some  selected  values  of  the  peak  locations  and  the  corresponding  estimated  phase 
values  at  those  peaks. 


Third  order 


Fig.  4.16  Effect  of  model  order  and  dataset  size  (16x16). 
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c.  Signal  to  Noise  Ratio  Experiment 

The  signal/noise  ratio  for  two  sinusoids  in  noise  was  defined  by  : 

C1  2 

SNR  =  10  log10  — —  ,  where  C  is  the  amplitude  of  each  sinusoidal  component  and  cr  is 

<r 

the  white  noise  variance.  The  results  of  the  cross  spectrum  estimation  for  a  64x64  point 
data  set  for  each  channel  with  sinusoids  of  frequencies  0l>h=<l>2i=~>  and  cd12=co22=~  are 

J 

shown  here.  The  model  used  for  these  experiments  was  second  order.  Estimated  values 
for  the  frequencies  and  phase  are  listed  in  Table  4.5.  At  a  SNR  value  of  12  dB  the 
sinusoids  are  completely  resolved  with  sharp  peaks  as  shown  in  Fig.  4.18.  At  a  SNR 
value  of  3.5  dB  the  peaks  begin  to  merge  (  see  Fig.  4.19)  and  at  0  dB  (Fig.  4.20)  the 
peaks  become  a  single  ridge  making  it  difficult  to  predict  that  there  are  two  sinusoids 
present.  The  phase  obtained  by  this  method  however  remains  nearly  constant  over  a 
wide  region  near  the  true  peaks  for  all  of  the  signal-to-noise  values.  The  phase  plots 
show  a  rather  slowly  varying  character  in  the  region  around  the  two  sinusoids. 

2.  Shift  Estimation 

This  section  contains  examples  of  multichannel  2-D  spectrum  analysis  involv¬ 
ing  narrowband  and  broadband  data.  These  cases  were  designed  to  test  the  accuracy  of 
estimating  a  linear  phase  term  in  the  cross  spectrum  and  the  ability  to  estimate 


TABLE  4.4  EFFECT  OF  DATA  SET  SIZE  ON 
THE  ESTIMATED  PARAMETERS 


Estimated  Parameters  [rad.  ] 

16x16  2nd  order 

8x8  2**  order 

8x8  3rd  order 

1.02 

undefined 

0.96 

0.97 

1.06 

1.04 

1.51 

undefined 

1.52 

1.57 

undefined 

1.64 

1.25 

1.24 

1.16 

True  Parameters  [rad] 
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parameters  of  a  known  2-D  random  process  from  data.  Three  different  numerical  experi¬ 
ments  are  considered. 


a.  Experiment  (1) 

For  this  experiment  two  channels  of  data  were  defined  by: 


xl(nl,n2)  =  wl(nx,n2)  (4.24a) 

x2(n1,n2)  =  C  xl(nx-dx,n1-d'2)+w2{nx,n1)  (4.24b) 

where  wx(nx,n 2)  and  w2(n  x,n2)  are  white  noise  terms  with  unit  variance  and  the  other 
parameters  are  0=0.5,^!  =  d2  =  1.  Spectral  estimates  were  generated  and  the  slope  of  the 
phase  of  the  cross  spectrum  was  measured  to  estimate  the  delays  dx  and  d2.  Fig.  4.21-a,b 
shows  the  autospectra  and  Fig.  4.21-c,d  shows  the  cross  spectrum  estimate  that  was 
obtained  using  a  second  order  filter.  The  magnitude  of  the  cross  spectrum  is  constant  and 
the  phase  shows  a  linear  dependence  with  slope  corresponding  to  dx  =  d2  =  1. 

b.  Experiment  (2) 

For  this  experiment  two-channels  of  real  data  were  defined  by: 


x\(nx,n2)  =/ (nx,n2) 


(4.25a) 


TABLE  4.5  EFFECT  OF  SNR  ON  THE  ESTIMATED 
PARAMETERS 


Estimated  Parameters  [rad.  ] 

True  Parameters  [rad] 

12  dB 

3.5  dB 

0  dB 

4>i 

1.01 

1.03 

1.04 

1.00 

$2 

0.99 

0.98 

0.98 

1.00 

ESI 

1.53 

1.50 

undefined 

n/2 

to12 

1.57 

1.50 

undefined 

n/2 

2(02! 

1.00 

1.04 

undefined 

Kf 3 

G>22 

1.00 

1.00 

undefined 

71/3 
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x2(n1,n2)  =  C  f  (nx-dl,n2-d2)  +  w(n1,n2)  (4.25b) 

where  f  (nx,n 2)  is  a  64x64  portion  of  the  image  shown  in  Fig.  4.22  (an  aerial  photo¬ 
graph  of  fields)  and  w{nx,n2)  is  a  zero  mean  white  noise  process.  Spectral  estimates 
were  generated  and  the  slope  of  the  phase  of  the  cross  spectrum  was  measured  to  esti¬ 
mate  the  shifts  dx  and  d2.  Figure  4.23  shows  the  phase  and  magnitude  of  the  spectral 
estimate  that  was  obtained  using  a  third  order  NSHP  model  for  signal  data  that  had 
parameter  values  of  C  =  0.5,  dj=3,  d2=5  and  white  noise  with  unit  variance.  The 
estimated  phase  shows  a  linear  dependence  with  slope  corresponding  to  ^=3.0,  d2=4.5 
which  rounds  to  integer  values  of  d  ^3,  d2= 5. 

c.  Experiment  (3) 

For  this  experiment  two  channels  of  sinusoidal  signals  are  considered 
L 

xl(nx,n2)  =  Xcos(n10)n+n2co1-2)  +  w1(n1,n2)  (4.26a) 

1=1 

x2(nl,n2)  =  C  xl(nl-dl,n2-d2)  +  w2(nx,n2)  (4.26b) 

where  wl(nl,n2)  and  w2(nxji‘$  are  zero  mean  independent  white  noise  signals,  and  L 
represents  the  number  of  sinusoids.  The  parameters  of  the  first  channel  are  given  in  sec¬ 
tion  la  Example  (3)  and  the  peak  locations  are  as  shown  in  Fig.  4.13b.  Spectrum  estima¬ 
tion  results  are  given  for  with  C  =  1  and  dx=d2=  1.  Fig.  4.24  shows  the  autospectrum 
of  the  two  channels,  while  Fig.  4.25  shows  the  magnitude  and  phase  of  the  cross  spec¬ 
trum.  The  peaks  are  shown  in  the  correct  location  and  the  phase  is  approximately  linear 
but  not  so  linear  as  in  the  other  previous  two  cases.  This  is  probably  because  the  magni¬ 
tude  of  the  spectrum  is  small  except  around  the  peaks  and  the  phase  in  this  case  is  more 
influenced  by  the  noise  except  at  the  peaks  location.  Nevertheless  the  phase  at  the  loca¬ 
tion  of  the  sinusoid  peaks  is  quite  accurately  estimated. 
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3.  System  Identification 
a.  Experiment  (1) 

The  purpose  of  this  experiment  is  to  generate  data  from  a  given  two- 
channel  2-D  difference  equation,  and  estimate  the  difference  equation  parameter  from 
these  data  by  using  linear  prediction  with  a  NSHP  model.  The  estimated  parameter 
coefficients  are  then  compared  with  the  true  parameters.  Our  investigated  process  is  the 
first  order  two-channel  2-D  difference  equation  given  by: 


x(n  j  ,n  2)  =  Aqj  x(/i  x  2- 1 )+ A_  t  _j  x(n  t  + 1  ,n  2+ 1  HA  j  ox(n  j- 1  ,n  2) 


(4.27a) 


+  Anx(/t  1-l,n2-l)  +  W(n  !,/i  2) 

A  data  array  2x(64x64)  is  generated  from  the  following  numerical  values: 


*0.6  O' 

'^(rt^-l)' 

x2(ni,«2) 

0.5  0 

x2(nhn2-l) 

(4.27b) 

2)' 

w2(«  i»/*2) 

where:  w,  (n  0  =1.2)  are  two  independent  white  noise  signals.  This  process  actually 
has  support  only  in  the  first  quadrant  but  a  first  order  NSHP  model  was  used  to  test  the 
estimation  procedure. 

Table  4.6  shows  the  result  of  estimating  the  parameters  with  a  (64x64) 
point  data  set.  The  estimates  for  the  non-zero  parameters  of  the  model  are  close  to  the 
given  values,  and  all  of  the  remaining  parameters  but  one  are  at  least  one  order  of  magni¬ 
tude  smaller.  The  estimated  spectral  components  for  this  process  are  shown  in  Fig.  4.26. 
Energy  is  spread  over  a  wide  range  of  frequencies  with  concentration  near  higher  values 
of  (Dj  and  lower  values  of  0)2.  This  is  consistent  with  the  signs  of  the  terms  in  the  defining 
equations  for  the  process. 


0  -0.7' 

+ 

0  0.4 

x2(nl-l,n2) 

+ 
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TABLE  4.6  MODEL  PARAMETERS  ESTIMATION  OF 
A  FIRST  ORDER  NSHP  MODEL 


Estimated  Parameters 


Ao,o 


1.00 

0.00 

0.00 

1.00 

^o.i 


A 


l.-l 


Ai.o 


0.564 

-0.027 

0.604 

-0.036 

'0.014 

0.143 

0.000 

0.022 

-0.051  -0.585 

0.012  0.352 


-0.017  -0.014 

0.034  -0.005 


True  Parameters 
'l.OO  0.00 
0.00  1.00 

0.60  0.00 
0.50  o.oq 

'o.oo  o.oo 
o.oo  o.oq 

'0.00  -0.70 

0.00  0.40 

'0.00  0.00 

o.oo  o.oq 


b.  Experiment  (2) 

The  goal  of  this  experiment  was  to  identify  an  unknown  system  H  (cd^cl^) 
on  the  basis  of  input/output  measurements.  Two-dimensional  white  noise  w(nl,n2) 
which  is  considered  as  a  first  channel  x  is  sent  through  the  unknown  system 

whose  output  x2(ni,«2)  is  considered  to  be  the  second  channel.  The  cross  spectrum 
between  the  channels  is  proportional  to  the  frequency  response  of  the  unknown  system. 
The  example  considered  here  corresponds  to  the  system: 

*i(n  x,ni)=xx(n  1(/t2)-0.38x  x,n  2-l)-0.38x  x(n  1-1,/i2)-0.24x  j(n  1-1,ii2-1) 

(4.28) 

Fig.  4.27(a)  shows  the  actual  firequency  response  of  the  system,  while  Fig.  4.27(b)  shows 
the  estimated  frequency  response  (amplitude  and  phase  of  the  cross  spectrum).  The 
estimated  system  parameters  are  given  in  Table  4.7.  Since  NSHP  support  was  assumed 
in  the  spectral  estimate  the  parameter  a  ^  comes  out  to  be  very  near  zero.  The  other 
parameters  in  Table  4.7  are  close  to  the  system  parameters  in  (4.28). 

D.  ESTIMATES  USING  CQ  MODELS 

This  section  discusses  the  results  of  the  CQ  method  on  the  examples  used  for  the 
NSHP  experiments  above.  The  results  are  found  to  be  quite  comparable.  Spectrum  esti¬ 
mates  for  Examples  1,  2,  and  3  were  also  computed  using  quadrant  plane  models.  The 
estimates  for  Examples  1  and  2  were  based  on  2x2  regions  of  support  (first  order  model) 


TABLE  4.7  COMPARISON  BETWEEN  THE  ACTUAL  AND  ESTIMATED 
PARAMETERS  OF  THE  SYSTEM  IDENTIFICATION  PROBLEM 


Estimated  Parameters 

True  Parameters 

1.00 

1.00 

<*0.1 

-0.4002 

-0.38 

<*1.-1 

-0.00001 

0.00 

<*1.0 

-0.3844 

-0.38 

<*1.1 

-0.1928 

-0.24 

(a)  Actual  frequency  response  (b)  Estimated  frequency  response 


for  the  first  and  second  quadrant  filters.  Figure  4.28(a)  shows  the  component  S  n  of  the 
spectral  matrix.  Results  for  S  22  are  similar.  The  use  of  either  the  first  or  second  quadrant 
filter  alone  results  in  a  spreading  of  the  peak  in  one  direction.  The  combined  estimate 
(Eqn.  (4.20))  gives  a  more  accurate  result  similar  to  that  of  the  NSHP  model.  Figure 
4.28(b)  shows  the  magnitude  and  phase  estimates  for  the  cross  spectrum  S 12.  A  similar 
spreading  phenomenon  is  observed  in  the  magnitude  estimate  but  the  estimation  of  phase 
is  correct  for  both  the  individual  and  the  combined  spectrum  estimates.  Figure  4.29 
shows  the  spectrum  estimate  for  two  sinusoids  in  white  noise  corresponding  to  Example 
2.  The  results  of  this  quadrant  based  model  can  be  qualitatively  compared  to  the  results 
for  the  NSHP  model  of  Figure  4.12.  The  placement  of  the  sinusoid  along  a  diagonal 
shows  some  characteristics  of  the  quadrant  models.  The  first  quadrant  results  show  a 
very  significant  spreading  of  the  peaks  along  a  direction  orthogonal  to  the  line  connecting 
their  centers.  This  is  observed  in  both  the  autospectral  components  and  the  cross  spectra. 
The  second  quadrant  estimates  show  good  resolution  with  little  spreading  of  the  peak. 
However  the  combined  estimate  gives  the  best  results  with  sharp  peaks  and  with  magni¬ 
tudes  more  nearly  equal  than  those  observed  with  the  NSHP  model.  The  phase  in  Fig. 
4.29(b)  is  slowly  varying  in  the  region  of  the  sinusoids  for  all  of  the  estimates  with  a 
correct  values  of  approximately  1.5  and  0.5  radians  at  the  locations  of  the  sinusoids. 
(Exact  values  produced  by  the  combined  estimate  are  1.42  and  0.57  radians  respectively.) 

Fig.  4.30  shows  the  results  for  estimation  of  the  three  sinusoids  given  in  Fig.  4.13 
using  combined  sixth  order  first  and  second  quadrant  filters.  These  results  can  be  com¬ 
pared  to  those  of  Figure  4.14.  Although  the  sinusoids  are  resolved  and  estimated  in 
approximately  the  correct  position,  the  peaks  are  less  sharp  than  for  the  NSHP  estimate. 
Computational  requirements  for  the  sixth  order  CQ  estimate  and  the  fourth  order  NSHP 
estimate  are  approximately  the  same. 

Figure  4.31  shows  the  amplitude  of  S12  estimated  by  using  the  first  and  second  qua¬ 
drant  filters,  separately.  We  have  seen  that  use  of  the  individual  quadrant  filters  can 
result  in  a  significant  bias  and  spreading  of  the  peaks.  In  the  particular  case  of  Fig.  4.31, 
we  notice  that  in  addition  to  the  bias  the  first  quadrant  estimate  for  the  cross  spectrum  has 
a  false  peak  near  the  frequency  cos  of  Figure  4.13.  The  second  quadrant  estimate  almost 
misses  the  frequency  co^ . 
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These  experiments  indicate  that  the  result  of  spectrum  estimation  using  a  single  qua¬ 
drant  model  is  not  generally  reliable  but  the  estimate  resulting  from  combining  the  two 
models  according  to  Eqn.  (4.20)  is  quite  accurate. 

E.  ESTIMATES  USING  THE  2-D  BURG  TECHNIQUE 

The  results  described  in  the  previous  sections  were  based  on  algorithms  that  first 
estimate  the  2-D  matrix  correlation  function  and  then  solve  Normal  equations  to  deter¬ 
mine  the  AR  model  parameters.  These  methods  will  be  called  indirect  since  they  require 
estimation  of  the  correlation  function  as  a  prerequisite  to  determining  the  model  parame¬ 
ters.  Here  the  2-D  multichannel  Burg  algorithm  cited  in  Chapter  in  is  used  to  form  the 
spectral  estimate. 

Two  test  cases  are  presented;  both  involve  sinusoids  buried  in  white  noise.  In  the 
first  test  case,  we  use  the  data  of  section  C,  Example  (1).  Fig.  4.32  shows  the  autospectra 
(  S  n,  S22  )  and  the  amplitude  and  phase  of  cross  spectrum  (5 12).  In  the  second  case,  we 
use  the  data  of  section  C,  Example  (2).  Fig.  4.33  shows  the  autospectra  and  the  ampli¬ 
tude  and  phase  of  the  cross  spectrum.  It  is  clear  that  we  have  a  good  estimate  for  single 
sinusoid  while  we  get  a  poor  estimate  when  the  number  of  sinusoids  increases.  This  is 
similar  to  what  happens  in  the  single  channel  2-D  Burg  algorithm. 

F.  ESTIMATES  USING  THE  DIRECT  CQ  METHOD 

A  direct  method  for  estimating  the  model  parameters,  i.e.,  a  method  that  estimates 
the  parameters  directly  from  the  data  was  described  in  Chapter  ID.  This  method  is 
very  well  suited  to  spectrum  estimation  using  combined  first  and  second  quadrant 
models  since  it  estimates  both  sets  of  filter  parameters  simultaneously. 

The  results  of  the  direct  CQ  method  on  the  examples  used  for  NSHP  and  CQ  experi¬ 
ments  above  are  found  to  be  quite  comparable.  Figure  4.34  shows  the  results  of  estimat¬ 
ing  a  single  sinusoid  in  white  noise  using  the  same  data  considered  in  section  C  (compare 
this  to  Fig.  4.28).  The  direct  method  yields  a  sharper  peak  with  somewhat  lower 
sidelobes.  Estimates  of  the  one  radian  phase  shift  between  the  channels  are  1.05  rad.  for 
the  indirect  method  and  1.03  rad.  for  the  direct  method. 
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Figure  4.35  shows  the  results  for  estimating  two  sinusoids  in  white  noise  using  com¬ 
bined  second  order  quadrant  filters  with  parameters  estimated  by  the  indirect  method. 
These  results  are  nearly  equivalent  to  those  shown  earlier  in  Figure  4.29.  A  few  minor 
peaks  appear  in  both  cases.  For  the  indirect  method  these  peaks  appear  closer  to  the 
main  peak  while  for  the  direct  method  they  appear  further  out  Phase  estimates  at  the 
peak  locations  are  1.48  and  0.50  radians  for  the  direct  CQ  method  while  they  are  1.47 
and  0.55  radians  for  the  indirect  method.  The  values  produced  by  the  direct  method  are 
slightly  closer  to  the  true  values  of  1.5  and  0.5  radians. 

Figure  4.36  shows  the  results  for  estimating  three  sinusoids  in  white  noise  using  two 
combined  sixth  order  quadrant  filters  with  parameters  estimated  by  the  direct  method. 
The  results  are  typical  of  those  obtained  in  other  examples.  The  sinusoids  are  resolved 
more  sharply  than  in  Fig.  4.30  and  are  quite  similar  to  the  NSHP  results  of  Fig.  4.14. 

The  results  for  the  shift  estimation  and  the  linear  model  examples  are  not  shown  here 
but  were  found  to  be  comparable  to  the  NSHP  and  CQ  results. 

An  important  advantage  of  the  CQ  method  over  the  NSHP  is  its  reduced  computa¬ 
tion.  This  is  discussed  further  in  the  next  section  where  a  quantitative  comparison  of  the 
estimates  is  given. 

G.  COMPARISON  RESULTS 


1.  Effect  of  Model  Order  on  Resolution 


In  order  to  compare  the  resolution  characteristics  of  the  NSHP  and  the  CQ 
method  quantitatively,  the  signal-to-noise  ratio  was  kept  constant  at  12  dB  and  the  model 
order  was  changed.  The  results  are  given  for  two-channel  2-D  signals,  each  channel  hav¬ 
ing  two  sinusoids  buried  in  white  noise  with  the  data  of  section  C,  Example  (2).  The  cri¬ 
terion  of  resolution  was  defined  as  the  minimum  frequency  separation,  below  which  two 
sinusoids  are  not  resolved.  Figure  4.37  summarizes  the  cross  spectrum  resolution  perfor¬ 
mance  of  the  methods  as  a  function  of  the  order  of  the  model.  As  shown  in  the  figure,  as 
the  model  order  increases,  the  resolution  increases.  Table  4.8a  shows  the  phase  of  the 
NSHP  estimate  as  a  function  of  the  model  order  while  Table  4.8b  shows  the  same  results 
for  the  CQ  method.  As  shown  in  the  table,  the  phase  estimate  is  good  even  when  the 
resolution  is  poor.  The  results  shows  an  estimate  which  is  consistently  close  to  the  actual 
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TABLE  4.8  PHASE  ESTIMATION  COMPARISON  OF 
DIFFERENT  AR  MODELS 


(a)  The  estimated  phase  of  NS  HP  method  as  a 
function  of  the  model  order. 


Model  Order 

♦i 

Y* 

3* 

1.489 

0.514 

4* 

1.487 

0.506 

5“ 

_ 

1.493 

— 

0.540 

(b)  Phase  estimation  of  CQ  method  as  a 
function  of  model  order. 


Model  order 


values  (<|>i=1.5  radians,  02=0.5  radians  )  and  varies  only  slowly  with  changes  in  the  order 
of  the  model. 

The  NSHP  estimate  has  a  somewhat  higher  resolution  than  the  CQ  estimate  for 
a  fixed  order  N  but  the  number  of  computations  required  is  considerably  higher.  In  par¬ 
ticular  if  we  assume  that  the  solution  of  the  Normal  equations  requires  a  number  of  com¬ 
putations  related  to  the  size  of  the  matrix  cubed  then  the  ratio  of  the  computations 
(NSHP/CQ)  is  approximately 

■  r  i  3 

[IN  (N+l)+  lj  /  (JV  +  1)2 

which  has  a  minimum  value  of  about  2  and  a  maximum  value  of  8  for  large  N.  If  the 
computations  required  for  solution  of  the  Normal  equations  is  proportional  only  to  the 
square  of  the  size  of  the  matrix,  then  the  ratio  varies  between  1.5:1  and  4:1.  If  we  con¬ 
sider  an  order  where  both  methods  have  approximately  the  same  resolution  we  can  com¬ 
pare  the  computation.  For  example  if  we  compare  the  3rd  order  NSHP  to  a  4!h  order  CQ 
estimate,  we  find  that  the  number  of  computations  is  about  the  same.  However  for  higher 
values  of  N  the  CQ  method  has  a  definite  computational  advantage. 

The  CQ  method  exhibits  a  slightly  but  consistently  better  resolution  when  the 
parameters  are  estimated  by  the  direct  method.  In  addition  since  the  computational 
requirements  are  proportional  to  the  square  of  the  order  of  the  matrix  in  Eqn.  (3.54),  the 
method  has  a  computational  advantage  as  well. 

2.  Divergence  Measurement 

In  this  final  section  we  apply  the  incremental  divergence  measure  (3.61)  to  the 
comparison  of  the  NSHP  and  CQ  spectral  estimates  for  three  examples.  The  results  are 
similar  to  those  in  Chapter  II  for  a  single  channel  2-D  estimate.  The  data  show  quantita¬ 
tively  that  the  CQ  method  gives  a  spectral  estimate  that  compares  closely  to  the  NSHP 
estimate  but  that  use  of  a  single  quadrant  model  alone  produces  an  estimate  that  com¬ 
pares  poorly  with  the  NSHP  estimate. 

In  the  first  example,  two-channel  2-D  signals  consisting  of  two  sinusoids  in  each 
channel  in  additive  noise  (tone  estimation  )  are  considered.  In  the  second  example,  we 
assume  a  two-channel  2-D  signal  with  linear  phase  difference.  Finally  in  the  third 


example,  we  consider  a  more  general  discrete  two-channel  2-D  AR  random  process  with 
NS  HP  support.  The  parameters  of  this  more  general  process  are  given  in  Table  4.6.  In 
all  cases,  equal  orders  were  used  for  the  NS  HP  and  quadrant  models. 

Table  4.9a  shows  the  results  of  incremental  divergence  of  NSHP  and  quadrant 
models  while  Table  4.9b  shows  the  results  of  incremental  divergence  of  the  NSHP  and 
quadrant  models  with  parameters  estimated  by  the  direct  method.  The  tables  show  that 
the  incremental  divergence  between  the  NSHP  model  and  the  individual  (  first  or  second 
)  quadrant  models  is  relatively  high  and  that  the  incremental  divergence  for  the  NSHP 
and  CQ  models  is  considerably  lower  indicating  a  closeness  of  the  spectral  estimates. 


Incremental  Divergence 

Comparison  between 

Ex.(l) 

Ex.  (2) 

Ex.(3) 

NSHP  &  l"  quadrant 

0.1735 

0.0628 

0.3857 

NSHP  &  2**  quadrant 

0.2576 

0.0943 

0.2762 

NSHP  &  CQ 

0.0657 

0.0262 

0.0869 

(b)  Comparison  between  NSHP  and  direct  CQ  models. 


Incremental  Divergence 


Comparison  between  Ex.(l)  Ex.(2) 


NSHP  &  l"  quadrant  0.1922  0.036 


NSHP  &  2**  quadrant  0.2584  0.0965 


NSH  &  direct  CQ 


Ex.(3) 


0.4147 


0.2728 


0.1046 


0.0157 


0.1229 


V.  MULTICHANNEL  2-D  MLM  SPECTRAL  ANALYSIS 

A.  INTRODUCTION 

The  maximum  likelihood  method  (MLM)  of  spectrum  estimation  was  originally  pro¬ 
posed  by  Capon  (1969)  in  a  multidimensional  case  as  an  array  processing  technique  and 
designed  to  accommodate  non-uniformly  spaced  data.  One  estimates  the  power  spectral 
density  by  effectively  measuring  the  power  output  of  a  narrowband  filter  as  this  filter  is 
swept  across  the  frequency  band  of  interest  (Lacoss,1971). 

It  was  seen  that  MLM  has  the  ability  to  resolve  closely  spaced  features  in  the  spec¬ 
trum  using  only  a  relatively  small  data  set.  Further,  although  the  MLM  has  resolution 
not  as  good  as  the  AR  spectral  estimation  method,  it  is  widely  used  in  practice,  especially 
in  array  processing.  In  addition  the  method  is  quite  simple  to  apply. 

In  this  chapter  we  review  the  2-D  single  channel  Maximum  Likelihood  Method 
(MLM)  of  spectral  estimation  and  describe  extensions  by  Baggeroer  and  Lagunas  et  al  to 
cross  spectrum  estimation  for  1-D  random  processes.  We  then  develop  the  MLM  method 
specifically  for  the  2-D  multichannel  case  and  compare  our  results  to  a  2-D  version  of  the 
earlier  results  where  the  components  of  the  spectral  matrix  are  computed  individually. 
We  discuss  the  differences  in  the  methods  and  compare  results  experimentally. 

Since  MLM  estimates  generally  have  significantly  poorer  resolution  than  AR 
model-based  estimates,  we  also  consider  the  so-called  Improved  MLM  (IMLM)  of  Lim 
and  Dowla  and  develop  it  for  the  multichannel  case.  This  method  retains  the  simplicity  of 
the  MLM,  but  gives  better  resolution.  We  carry  out  an  experiment  to  compare  the  ampli¬ 
tude  and  phase  of  cross  spectra  of  AR,  MLM  and  IMLM  techniques.  Finally,  in  this 
chapter  we  measure  and  compare  the  resolution  properties  of  the  estimate  experimentally 
as  a  function  of  SNR  and  as  a  function  of  model  order. 

B.  SURVEY  OF  SINGLE  CHANNEL  2-D  MLM 

In  the  1-D  case,  the  MLM  spectral  estimate  at  frequency  (Dq  is  determined  by  design¬ 
ing  an  FIR  filter  which  passes  the  power  in  a  very  narrow  band  around  0)9  and  rejects  the 


power  due  to  other  frequency  components.  The  same  interpretation  can  be  applied  to  2- 
D  spectral  analysis. 

Suppose  the  output  of  a  linear  2-D  FIR  filter  is: 


y  ("  i*"2)  =  L  Z  x  («  r*  i  'n  2-'  2) 


(5.1) 


where  a  is  the  region  of  support,  x  is  the  sampled  input,  and  aiiix  are  the  filter 
coefficients.  Suppose  further  that  we  wish  to  estimate  the  power  spectrum  S^n  (co^o^a) 
at  the  specific  frequency  (o)j0),o>^0)).  The  filter  weights  a(-  ,•  are  determined  by  minimiz¬ 


ing  E 


lyin^n-2) 


ponent  of  x(n,,n 
as  follows 


I  j  subject  to  the  constraint  that  the  filter  passes  the  frequency  com- 
2)  at  (Q)j(0),O)j0>)  with  unit  gain  and  zero  phase  shift.  The  formulation  is 


minimize 

^#1 


E 


l>*  («  1  2) 


2 


(5.2) 


subject  to  the  constraint 


I  « ■'-'V 


(5.3) 


The  spectral  estimate  Sw^  (CjL){0>.co40);Ot)  in  this  case  is  taken  to  be  the  average  power 


I  y  ( n  j  ,n  2 ) 1 


of  the  output  of  the  filter  with  the  calculated  filter  coefficients 


Then  we  can  say  that  the  MLM  estimates  the  power  spectrum  SML  (oq.OKCt)  by  design 
mg  at  each  frequency  (co1(0),a>j0))  a  different  FIR  filter  such  that  the  filter  passes  the  fre 
qucncy  component  ( o)lf04.a>A<))>  without  modification  and  minimizes  the  power  due  to 
other  interfering  spectral  components.  When  the  optimal  filter  is  determined  and  substi 
tuted  into  the  expression  for  the  output  power  (Lacoss,197 1 ).  the  resulting  estimate  takes 
the  simple  form 


(Wj.COjiQt) 


Rci‘  «a 


(5.4) 


where  Ru  is  the  correlation  matrix  corresponding  to  the  data  x(n,,ni)  in  a  in  Eqn.  <5.1) 


and  ea  is  the  correspondingly  ordered  vector  of  complex  sinusoids. 


=  (5.5) 

The  resolution  performance  of  the  MLM  spectral  estimate  actually  lies  somewhere 
between  that  of  the  Fourier  transform  technique  and  that  of  the  AR  method.  The  reason 
is  clear  from  a  relation  that  exists  between  the  MLM  and  AR  methods  which  is  cited 
below. 


1.  Relationship  Between  MLM  and  AR  Spectral  Estimation 

Burg  (1972)  showed  that  there  exists  a  simple,  exact  relationship  between  max¬ 
imum  entropy  spectra  and  maximum  likelihood  spectra  in  the  1-D  case,  when  the  correla¬ 
tion  function  is  known  at  uniform  intervals  of  lag  for  -K  <Jc  <K .  The  relationship  is 
given  bv: 


)  - 


—  £  - - - 

K  usi  5vf£(co;n  ) 


where  SUL(a):J( )  represents  the  MLM  spectral  estimate  of  order  K,1  and  Sw£(w,n) 
represents  the  MEM  spectral  estimate  of  order  n.  Eqn.  (5.6)  shows  that  the  reciprocal  of 
the  MLM  spectrum  is  equal  to  the  average  of  the  reciprocals  of  the  maximum  entropy 
spectra  obtained  using  from  one  point  up  to  K -point  prediction  error  filters. 

For  the  2-D  case  Dowla  and  Lim  ( 1984)  have  shown  that  there  is  an  exact  rela¬ 
tionship  between  the  MLM  spectra  and  the  spectra  obtained  by  AR  signal  modeling  using 
a  known  autocorrelation  sequence  Ru'j.iM  of  a  2-D  random  signal  in  the  region  a.  This 
relation  is 


CiKOt)  = 


S^kDi.OKV) 


where  if  all  samples  in  the  region  are  ordered  in  some  arbitrary  fashion  then 


That  is,  the  support  of  the  1-D  filter  is  a  set  of  K  adjacent  points. 
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^(©i.o^Y)  represents  the  spectrum  of  the  linear  predictive  filter  that  estimates  sample 
number  y  from  "previous"  samples  in  the  ordering.  The  sum  is  over  all  such  spectra. 


2.  Improved  MLM 

Recently  the  improved  MLM  (IMLM)  was  proposed  by  Lim  and  Dowla 
(1983,1985)  for  multidimensional  spectral  analysis  as  an  alternative  to  the  MLM.  The 
method  is  based  on  the  relation  that  exists  between  MLM  and  AR  spectral  estimates 
(Eqn.  (5.7)).  The  IMLM  algorithm  to  compute  the  spectral  estimate  S1ML  is  a  combina¬ 
tion  of  two  MLM  estimates  and  has  the  form  (Lim  and  Dowla,  1985): 


siml  (g>1,co2;<x,|3)  - - - - - -  ^ 

Sml(  ©i.G>2;a) 

where  at  frequency  (co1,co2),  SW£i(0)ltCi>2;a)  represents  the  MLM  spectral  estimate  based 
on  all  the  known  correlation  points  for  the  region  a  (Eqn.  (5.4)).  And  SWi(to1,o)2;p) 
represents  the  MLM  spectral  estimate  given  on  the  region  (3  where  P  is  a  subset  of  a. 


Sml«»  =  -rr~ j —  (5.9) 

ep  Rp  e3 

Lim  and  Dowla  proved  that  the  resulting  S/ml  (ci>i  ,c»>2;ot,P)  is  always  non-negative  when¬ 
ever  P  is  a  subset  of  a. 

To  illustrate  the  performance  of  the  IMLM  discussed  above,  we  present  one 
example.  Consider  a  single  channel  2-D  random  process  x(n^,n2)  consisting  of  two- 
sinusoids  in  white  noise  w(nx,n2)  as  given  in  Eqn.  (2.56)  with  the  same  parameters.  The 
correlation  matrix  is  developed  from  these  rea.  data.  Region  a  is  taken  to  be  the  rec¬ 
tangular  region  (-P  \,-P2)  <  (i  j,/ 2)  £  (P  \JP 2)  where  Pj  and  P2  are  chosen  to  be 
P 1  =  P  2  =  3.  Region  P  is  a  subset  of  a  and  formed  from  region  a  by  removing  the  point 
(P  \J*  2)  from  the  filter  support.  Figure  5.1a  shows  the  result  of  MLM  and  Fig.  5.1b 
shows  the  result  of  IMLM.  It  is  clear,  and  in  fact  not  surprising,  that  the  IMLM  has 
significantly  better  resolution  properties  than  the  MLM. 


C.  CROSS  SPECTRUM  ML  ESTIMATION  TECHNIQUE 

The  MLM  as  originally  developed  applies  to  multidimensional  signals  but  computes 
only  the  autospectrum  of  a  single  m-D  data  set.  Subsequent  efforts  were  made  by  Bag- 
geroer  (1975),  and  Lagunas  et  al  (1985)  separately,  to  estimate  the  cross  spectrum 
between  two  data  sets.  In  their  methods  each  channel  is  treated  separately.  We  refer  to 
this  method  as  the  cross  spectrum  procedure.  Our  goal  is  to  apply  these  methods  to  the 
2-D  case  and  compare  the  experimental  results  with  the  general  solution  given  in  the  next 
section. 

In  this  section,  we  review  the  cross  spectrum  procedure  as  it  would  apply  to  two  2-D 
random  processes  before  describing  our  more  general  extension  of  the  ML  spectrum  esti¬ 
mation  technique  to  the  multichannel  case. 

The  problem  of  obtaining  a  cross  spectral  estimate  for  two  stationary  random 
processes  xx(n  and  x2(rt  i,n^)  with  zero  means  as  developed  in  Baggeroer  (1975) 
and  Lagunas  et  al  (1985)  is  based  on  the  design  of  two  narrow-band  FIR  filters 
H ^a)!, CDj)  and  //2(o)i,co2)  centered  at  the  same  frequency  (col^.cop). 

If  the  input  to  //^CDj.co^  and  //2(co 1,9*2)  316  ^i(rti»«2)  and  x2(n1,n2)  respectively, 
then  the  outputs  )’i(nl,n2 )  and  y2(n  should  contain  the  spectral  components  at 
(£0<0),£of)  and  minimize  the  effects  of  spectral  components  at  other  frequencies.  Let  the 
filter  outputs  be  given  by 


r  r 

y\(nl,n2)=  X! 

(5  10) 

y2(«  i»rt2)  =  [aaJ  x2 

(5.11) 

where  Xj  and  x2  are  vectors  of  the  signal  values  x j (n  \-i\,n2-i2)  and  x2(n  \,n2-i2)  for 
(i  | ,/  2)e a  and  and  a^2'  represent  the  filter  coefficient  vectors  in  the  region  of  support 
a  for  the  filters  H  j  and  H  2. 

Capon  (1969)  has  shown  that  the  resulting  filter  coefficients  for  the  x,  and  x,  random 
processes  can  be  written  in  the  following  form: 


\  '■-r* 


(5.12) 


(5.13) 


where  Ru  and  R22  represent  the  correlation  matrices  of  the  two  random  processes 
Xj  and  x2,  respectively,  and  ea  is  defined  by  Eqn.  (5.5).  Then  the  cross  power  estimate  is 
taken  to  be 


512(co1,co2;a)  =  £: 


y  \  (n  i  >n  i)  y  2^n  un  2) 


,(D 


R.2a™ 


(5.14) 


where 


Rl2 


T 

X,  x[ 


(5.15) 


If  Eqns.  (5.12)  and  (5.13)  are  substituted  in  Eqn.  (5.14),  the  cross  spectrum  power  esti¬ 
mate  can  finally  be  written  in  the  form 


S  ^(Wj.ov.a) 


R-l 
“a  K11 


R 


e 


a 


(5.16! 


This  is  the  cross  spectrum  procedure. 


D.  DEVELOPMENT  OF  MULTICHANNEL  2-1)  MEM 

In  this  section  we  describe  a  more  general  extension  of  the  MI  NI  spectra!  estimation 
procedure  to  the  M-channc!  case  In  this  case  the  all  components  of  the  spectral  matrix 


5u(o)i,co2;a)  S^Cco^a)  .  .  .  S^coj.co^a) 
S2i(0)i,(02;a)  S22(w1(co2;a)  .  .  . 


SWL(c°i-0)2;a)  = 


(5.17; 


•  •  •  $MM  (CDj.CD^a) 


are  computed  at  once. 

Suppose  we  are  given  samples  of  a  2-D  multichannel  random  process  x(n  ;,n2)  over 
some  region  of  the  plane.  This  signal  is  to  be  processed  by  a  multichannel  FIR  nar¬ 
rowband  filter  to  obtain  an  estimate  of  the  power  matrix  at  frequency  ( co{°\co^0>').  The 
output  y (n  x,n  2)  of  this  filter  is  given  by  : 


y(nx,n2)=  I  (5.1Ki 

(*,.*2jea 

where  a  represents  the  region  of  support  and  A tiij  are  Mx.vf  matrix  coefficients.  Assume 
that  the  input  to  the  filter  is  a  complex  sinusoid  given  by 


xtnj.fli)  =  ce' 


( 0>|l<Tln , 


(5. 19 1 


where  c  is  constant.  We  want  to  choose  the  filter  weights.  At  ,  such  that  the  filter 
passes  the  signal  in  each  channel  with  zero  phase  and  unit  gain  and  there  is  no  cross  cou¬ 
pling.  That  is.  we  require 


■  ux  n  *  ux  >1 .  > 


yini,nu  =  ce 

or  equivalently  that  the  frequency  response  Huo,n‘.w;'ui  satisfio 


5  20 1 


M  i  M  'li  v — .  » — •  4  uv  t  *  n>*  i  - 

1  to.  .to-  2.  2.  A,  (  c 

«  .« <11 


w  here  I  is  the  i  M ■  M  ulentitv  matrix  Hus  constraint  1  hqn 


■ 1  can  be  expressed  .0 


I  =  Ea*TAa 


(5.22) 


where  Ea  is  the  block  matrix  of  complex  sinusoids  defined  by  the  direct  product 

Ea  =  ea  0  I  (5.23) 

where 


ea  =  Col 


eJ«^n-L  *  ayii) 


(5.24) 


and  Aa  represents  the  correspondingly-ordered  block  matrix  of  filter  coefficients.  The 
filter  output  power  matnx  is  given  by 


where 


F  =  £ 


y<rt,,*2)yr(n1,rt  2) 


=  Aa^Ra  Aa 


(5.25) 


Ra  =  £ 


*a  xo 


(5.26) 


and  xa  is  the  ordered  vector  of  the  given  samples.  The  MLM  spectra]  estimate  is  defined 
as  the  power  matnx  of  the  filter  that  minimizes  (5.25)  subject  to  the  constraint  (5.22). 

The  general  solution  to  this  problem  which  yields  the  optimal  set  of  filter  coefficients 

is 


Aa 


It  this  is  substituted  in  Eqn.  (5.25)  the  spectral  estimate  can  be  written  as 


S w  (u^.uxa)  = 


R„  Er 


(5.27) 


(5  28) 


To  show  the  specific  form  of  this  result  for  a  particular  case,  suppose  the  filter  has 
rectangular  support  of  order  (P  ,  .P ;).  Then  the  matnx  Aa  is  defined  by: 


9 


v 
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vj.-v.yj-k-  yn  <,-v  yv  ,-v  yv  yy.-v  yv  -.-s 


where 


Ai  = 


A(*  .0) 

A(«.i) 


A(«i*i) 


and  each  submatrix  A(i  n  is  given  by 


(5.29) 


(5.30) 


’BoofiJ)  ao\(ij)  •  •  •  a0M(i,jy 

aioiij )  flnOV)  •  •  •  aw(ij) 


aMo(ij) 


(5.31) 


The  matrix  of  complex  exponentials  Ea  is  now  defined  by  the  Kronecker  product 


E«x  =  (e2  ®  *i)  ®  I 


(5.32) 


where  et  and  e2  are  given  by 


i. 


R(0)  R(-l)  .  .  .  R( -PJ  ' 

R(l)  R(0)  .  .  .  Ri-Px+l) 

(5.37) 

R(Pj)  R(/V1)  .  .  .  R(0) 

where 

K(*,0)  R(k,-l)  .  .  .  Rik-P?)  ' 

R(k,l)  R(k, 0)  .  .  .  R(k-P2+l) 

(5.38) 

R(kJ> 2)  R(kP 2-l)  .  .  .  R(k, 0) 

and  each  submatrix  R(k ,/)  is  the  2-D  (MxAf)  matrix  correlation  function. 

In  order  to  check  the  performance  of  this  technique,  two  different  examples  are  con¬ 
sidered  here  and  the  results  are  compared  with  the  cross  spectrum  ML  estimate  given  in 
the  previous  section. 

In  the  first  example,  we  compare  the  results  of  autospectrum  estimation  using  the 
single  channel  MLM  and  the  results  of  cross  spectrum  estimation  between  two  2-D  data 
sets  obtained  by  applying  the  cross  spectrum  procedure  to  the  results  of  the  multichannel 
MLM  developed  here.  Two  data  records  representing  the  two  channels  are  given  by 
Channel  (1): 

L 

xl(n1,n2)=  '£cos(n10)il+n2(Oi2)  +  wl(nl,n2)  (5.39a) 

i= 1 

Channel  (2) : 

L 

x2{n  j,n2)  =  £c°s(/t  1a)i3+n2coi4+  <{>,•  )  +  w2(nl,n7)  (5.39b) 

i= l 


m 


where  and  ^(n^n^  are  zero  mean  independent  white  noise  signals  and  L 

represents  the  number  of  sinusoids.  Spectrum  estimation  results  are  given  for  a  dataset 
size  of  64x64  and  rectangular  support  with  P  ^  =  P2  =  5.  We  assume  L=2,  with 
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0)11=0)12=~,  C02i=^022=Y’  C0i3;=t0H='7>  Gi23=C024=:T’  4>i=l-5  radians,  and  <$>2=0.5  radi- 

ans.  Fig.  52  shows  the  autospectrum  estimate  of  the  first  and  second  channels  using  the 
single  channel  MLM  and  Fig.  5.3  shows  the  autospectrum  (Su522)  components  of  the 
spectral  matrix  from  the  multichannel  MLM.  Fig.  5.4  shows  the  cross  spectrum  estima¬ 
tion  results,  amplitude  and  phase,  using  the  cross  spectrum  procedure  while  Fig.  5.5 
shows  the  results  of  the  cross  spectrum  component  for  the  multichannel  MLM.  The 
estimated  phase  at  the  corresponding  peaks  for  the  cross  spectrum  method  are  (^  =  1.480 
radians,  and  <j>2  =  0.530  radians  while  they  are  1.481  and  0.529  radians  in  case  of  mul¬ 
tichannel  MLM  respectively.  The  actual  values  are  <p1  =  1.5  radians  and  (fo  =  0.5  radians. 
The  results  for  this  example  as  in  others  where  the  noise  in  the  two  channels  is  uncorre¬ 
lated,  were  found  to  be  very  close. 

In  the  second  example  we  make  two  changes  to  the  data.  First  the  location  of  the 
sinusoids  are  move  closer  to  each  other  (co1  i=C0ii=^- ,  0>2i=a>22=~~,  ai3=fflu=r>  and 

0)23=0324=^-)-  Secondly  the  noise  in  the  two  channels  was  made  highly  correlated  with 


covariance 


[  1  0. 

^  =  0.9  1 


and  zero  mean.2  Fig.  5.6  shows  the  cross  spectrum  estimation  results  using  the  cross 
spectrum  procedure  while  Fig.  5.7  shows  the  5 12  component  using  multichannel  MLM. 
The  peaks  are  resolved  more  sharply  in  Fig.  5.7  than  in  Fig.  5.6. 

E.  IMPROVED  MLM  POWER  SPECTRAL  ESTIMATION 

The  MLM  has  significantly  poorer  resolution  than  the  AR  method,  but  it  is  widely 
used  in  practice  due  to  its  relative  simplicity.  As  stated  earlier  Lim  and  Dowla  (1983) 
have  proposed  an  improvement  to  the  MLM  method  based  on  the  relation  that  exists 
between  the  MLM  and  AR  spectrum  estimation  methods  (Dowla  and  Lim,  1 984).  Their 


Observe  that  the  noise  is  still  "white"  noise  since  it  is  spatially  uncorrelated. 
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idea  can  be  applied  to  our  procedure  for  estimation  of  the  multichannel  2-D  spectral 
matrix. 

Let  the  points  in  the  region  a  be  divided  into  two  subsets  (J  and  a-($.  Then  the 
"improved"  maximum  likelihood  method  (IMLM)  for  spectral  matrix  estimation  is 
defined  by 

f  'i  -l 

S/AfL(a)l.“2>a’P)=  SjiJ>((D1,<B2;a)-S^i>(Q>1,CD2;p)  (5.40) 

where  S^Cco^ojjja)  represents  the  2-D  multichannel  MLM  spectral  estimate  based  on 
all  the  known  correlation  points  in  the  region  a  (see  Eqn.  (5.28)),  and  S^(c0j, o^ip)  is 
the  ML  spectral  matrix  estimates  using  support  region  P 

Sml&iWP)  =  [EpTRp-^Ep]  _1  (5.41) 

The  IMLM  of  Lim  and  Dowla  has  demonstrated  higher  resolution  properties  than  the 
MLM  for  single  channel  2-D  data.  This  property  will  be  seen  to  carry  over  to  the  estima¬ 
tion  of  the  entire  spectral  matrix  using  (5.40).  Three  different  examples  are  considered 
here. 


1.  Example  I 

In  this  example  we  carried  out  an  experiment  to  compare  the  autospectra  and  the 
amplitude  and  phase  of  cross  spectra  of  the  MLM  and  IMLM  techniques.  In  the  two 
methods  we  considered  a  2-D  two-channel  real  process  \{nx,n2)  consisting  of  sinusoids 
buried  in  white  noise.  The  two  data  records  which  represent  the  two  channels  are  given 
by  Eqn.  (5.39)  with  the  same  parameters  cited  following  that  equation.  The  results  are 
given  for  PX=P2= 4.  Fig.  5.8  shows  the  auto-  and  cross  spectra  results  of  the  MLM 
method.  Fig.  5.9  shows  the  results  of  IMLM.  This  example  shows  the  higher  resolution 
properties  of  the  IMLM  relative  to  the  MLM.  The  values  of  the  phase  of  the  cross  spec¬ 
tra,  at  the  peak  locations  for  each  of  the  two  techniques,  are  close  to  each  other  and  very 
close  to  the  actual  values. 

We  have  considered  many  similar  examples  with  different  choices  of  the  model 
order  and  the  peak  positions.  In  all  cases,  we  observed  that  IMLM  provides  significantly 
better  resolution  than  MLM. 


2.  Example  2 

The  main  purpose  of  this  experiment  is  to  identify  an  unknown  system 
//(©!, 0)2)  by  applying  multichannel  2-D  MLM  and  IMLM  techniques.  In  this  example 
the  first  channel  is  a  zero  mean  white  noise,  while  the  second  channel  is  the  output  of  the 
unknown  2-D  system  which  is  driven  by  the  noise  in  the  first  channel.  We  consider  here 
the  specific  example  given  by  Eqn.  (4.28)  with  the  same  parameters. 

Fig.  5.10a  shows  the  amplitude  and  phase  of  the  estimated  frequency  response 
applying  the  IMLM  technique  while  Fig.  5.10b  shows  the  results  using  the  MLM  tech¬ 
nique.  The  two  results  are  similar  to  each  other  in  this  specific  example.  However  com¬ 
paring  these  results  with  the  true  frequency  response  given  in  Fig.  4.27a  we  find  that  the 
IMLM  results  are  closer  to  the  true  results  than  the  MLM. 

3.  Example  3 

The  third  test  example  is  concerned  with  estimation  of  a  2-D  linear  phase.  The 
signal  in  the  first  channel  consists  of  white  noise,  and  the  signal  in  the  second  channel  is 
a  delayed  version  of  the  first  channel.  The  specific  example  shown  here  is  given  by  Eqn. 
(4.24)  with  the  same  parameters. 

In  this  experiment  the  delay  between  the  two  channels  was  measured  by 
estimating  the  phase  of  the  cross  spectrum.  The  results  of  the  spectral  matrix  com¬ 
ponents  of  the  MLM  and  the  IMLM  are  shown  in  Figs.  5.1 1  and  5.12  respectively.  The 
results  were  found  to  be  very  close  to  each  other.  The  autospectra  and  magnitude  of 
cross  spectra  are  constants.  The  phase  of  the  cross  spectrum  is  linear  with  slope 
corresponding  to  dj=^2=l  as  we  expect. 

F.  NUMERICAL  EXAMPLE  AND  COMPARISON  RESULTS 

In  order  to  compare  the  resolution  and  phase  estimation  characteristics  of  the  AR, 
MLM  and  IMLM,  we  carried  out  additional  experiments  to  compare  the  autospectra  and 
the  amplitude  and  phase  of  cross  spectra  of  the  three  techniques.  In  all  three  methods,  we 
assumed  a  2-D  two  channel  random  process  with  the  data  for  each  channel  consisting  of 
sinusoids  buried  in  white  noise.  We  assumed  three  sinusoids  with  the  position  of  the 
peaks  as  shown  in  Fig.  4.13  and  with  the  same  parameters. 
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Fig.  5.13  shows  the  auto  and  cross  spectra  results  of  the  MLM  method.  Fig.  5.14 
shows  the  results  of  IMLM.  The  dataset  size  is  (64x64)  for  each  channel  and  the  order  of 
the  model  is  four.  The  results  can  be  compared  to  those  obtained  in  Chapter  IV,  (see  Fig. 
4.14).  Such  a  comparison  results  shows  the  high  resolution  properties  of  the  AR  relative 
to  the  MLM.  It  also  shows  that  EMLM  has  resolution  superior  to  the  MLM,  and  that  it  is 
very  close  to  that  of  the  AR.  The  values  of  the  phase  estimate  of  the  cross  spectra,  for 
each  of  the  three  techniques,  are  close  to  each  other  and  very  close  to  the  actual  values. 

We  have  considered  many  different  variations  of  this  example  with  different  choices 
of  the  model  order,  number  of  sinusoids  and  peak  positions.  The  results  show  a  con¬ 
sistently  higher  resolution  for  AR  and  IMLM  than  for  MLM. 

In  the  remaining  part  of  this  section  we  measure  and  compare  the  resolution  proper¬ 
ties  of  the  three  methods  quantitatively.  The  resolution  properties  are  studied  both  as  a 
function  of  the  signal  to  noise  ratio  (SNR)  and  as  a  function  of  the  order  of  the  model. 

For  the  following  studies,  a  two  channel  2-D  random  process  is  again  used.  Each 
channel  has  two  sinusoids  buried  in  white  noise  in  the  form  of  Eqn.  (5.39).  The  criterion 
for  resolution,  as  defined  before  in  Chapter  IV,  is  the  minimum  frequency  separation 
above  which  two  sinusoids  are  not  resolved.  Sinusoids  are  considered  to  be  resolved  if 
there  are  two  peaks  in  the  spectrum  with  a  valley  between  them.  The  smaller  the  value 
of  the  separation,  the  better  the  resolution. 

The  experimental  procedure  is  as  follows.  The  location  of  one  sinusoid  is  kept  con¬ 
stant,  then  the  position  of  the  peak  of  second  sinusoid  is  changed  slightly.  Starting  with 
the  frequency  of  the  second  sinusoid  very  close  to  the  first,  we  find  the  spectrum  of  the 
two  sinusoids.  If  the  two  peaks  appear  as  one  peak  we  move  the  second  sinusoid  slightly 
away  from  the  first  one  and  recomput  the  spectrum.  We  repeat  this  procedure  until  we 
obtain  a  valley  between  the  two  peaks. 

I.  Effect  of  Model  Order  on  Resolution 

In  order  to  compare  the  resolution  characteristics  of  the  NSHP,  MLM,  and 
IMLM  quantitatively,  the  SNR  is  kept  constant,  SNR  =  0  dB,  and  the  model  order  is 
changed.  Fig.  5.15  summarizes  the  cross  spectrum  resolution  performance  of  the  three 
methods  as  a  function  of  the  order  of  the  model.  As  shown  in  the  figure,  the  resolution 
increases  with  increasing  model  order.  The  NSHP  AR  model  method  has  the  highest 
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resolution,  while  the  MLM  has  the  poorest  resolution.  The  resolution  of  the  IMLM  is 
close  to  that  of  the  NSHP. 


Table  5.1  shows  the  phase  estimate  of  the  cross  spectrum.  It  is  clear  from  the 
table  that,  all  three  methods  produce  good  phase  estimates  that  are  close  to  the  true 
values. 


2.  Effect  of  SNR  on  Resolution 


The  experiment  just  described  was  carried  out  again  except  that  this  time  the 
model  order  was  kept  constant  (equal  to  3)  and  the  SNR  was  changed.  Fig.  5.16  sum¬ 
marizes  the  cross  spectrum  resolution  performance  of  the  NSHP,  IMLM,  and  MLM  esti¬ 
mates  as  a  function  of  the  SNR.  The  results  show  clearly  that  IMLM  has  a  resolution 
which  is  much  better  than  the  MLM  and  quite  close  to  but  not  as  good  as  the  NSHP  AR 
method. 

The  phase  estimation  results  for  the  cross  spectrum  are  shown  in  Table  5.2  .  We 
have  a  good  phase  estimate  for  SNRs  greater  than  -6  dB,  and  a  poor  estimate  for  SNRs 
less  than  -6  dB. 


TABLE  5. 1  CROSS  SPECTRUM  PHASE  ESTIMATION  AS  A 
FUNCTION  OF  MODEL  ORDER 


(a)  The  estimated  values  of  <|)1 


Phase  Estimation  (<J>j) 

Method 

Model  Order 

2 

3 

4 

5 

NSHP 

1.464 

1.478 

1.476 

1.520 

MLM 

1.450 

1.452 

1.456 

1.467 

IMLM 

1.543 

1.456 

1.433 

1.426 

(b)  The  estimated  values  of  <J>2 


Phase  Estimation  (<j>2) 

Method 

Model  Order 

2 

3 

4 

5 

NSHP 

0.554 

0.554 

0.518 

0.524 

MLM 

0.513 

0.558 

0.508 

0.477 

IMLM 

0.550 

0.521 

0.490 

0.533 

TABLE  5.2  SNR  EFFECT  ON  THE  ESTIMATED  PHASE 


(a)  4>!  Estimation 


Phase  Estimation  (c^) 

Method 

SNR  (dB) 

-12 

-6 

0 

6  12 

NSHP 

1.250 

1.409 

1.478 

1.480 

1.489 

MLM 

1.412 

1.435 

1.452 

1.472 

1.436 

IMLM 

1.332 

1.455 

1.456 

1.452 

1.498 

(b)  ^Estimation 


Phase  Estimation  (<f>2) 

Method 

SNR  (dB) 

-12 

-6 

0 

6 

12 

NSHP 

0.493 

0.462 

0.518 

0.510 

0.514 

MLM 

0.496 

0.478 

0.557 

0.510 

0.530 

IMLM 

0.402 

0.510 

0.521 

0.491 

0.485 

VI.  APPLICATIONS  OF  SPECTRAL  ANALYSIS 
MODELS  TO  IMAGE  CODING 

A.  INTRODUCTION 

Image  coding  is  a  fairly  new  subject  which  has  been  practiced  over  the  last  15  years. 
Many  efforts  have  been  made  towards  the  implementation  of  digital  image  coding. 
Image  coding  applications  can  be  found  in  television  and  facsimile  transmission  of 
printed  materials  and  in  the  continuous  source  of  weather  photographs  and  earth  resource 
pictures  sent  to  earth  by  satellites.  In  addition  applications  can  be  found  in  the  military 
field  in  areas  such  as  the  control  of  remotely  piloted  vehicles. 

Efficient  coding  for  these  application  can  provide  a  substantial  decrease  in  the 
transmission  costs.  As  an  example,  broadcast  digitized  television  signals  need  about  100 
Mb/s  which  is  a  very  high  data  rate.  Efficient  coding  can  bring  this  requirement  down  to 
a  few  hundred  kb/s.  The  main  purpose  of  coding  therefore,  is  to  decrease  the  transmitted 
bit  rate,  but  to  maintain  a  certain  acceptable  level  of  fidelity  (Pratt,  1978;  Gibson,  1980; 
Goodman  and  Gersho,1974;  Jain,1981;  Maragos  et  al,1984;  Netravali  and  Limb,1980; 
Jayant,1974,1976;  Jayant  and  Noll, 1984;  Zetterberg,1982;  Makhoul,1975;  Linde  et 
al,1980;  Gray, 1982;  Gray  and  Linde,1984;  Cuperman,1985;  Hang  and  Woods,1985; 
Goldberg,  1986).  The  required  image  quality  and  intelligibility  vary  widely  depending  on 
the  application.  For  example,  in  remotely  piloted  vehicle  applications,  the  image  intelli¬ 
gibility  is  more  important  than  the  quality,  while  in  applications  such  as  digital  televi¬ 
sion,  high  quality  is  essential. 

An  overall  image  coding  and  transmission  scheme  is  shown  in  Fig.  6.1.  The  digital 
image  is  encoded  by  an  image  coder.  The  output  of  the  image  coder  is  a  sequence  of  bits 
which  represents  the  source  image.  The  channel  coder  transforms  this  sequence  of  bits  to 
a  form  suitable  for  transmission  over  a  communication  channel  by  using  the  modulator 
which  maps  the  code  outputs  1  and  0  into  analog  signals.  The  demodulator  at  the 
receiving  side  uses  an  appropriate  decision  threshold  to  interpret  received  signals  as 
corresponding  to  either  1  or  0.  The  image  decoder  reconstructs  the  image  from  this 
sequence  of  bits. 
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The  image  coder  can  be  thought  of  as  having  three  stages.  The  initial  stage  of  image 
coding  provides  a  good  representation  of  the  signal.  This  operation  is  generally  reversi¬ 
ble.  The  type  of  coding  used  in  this  stage  can  be  classified  into  three  categories,  namely, 
waveform  coders,  transform  coders,  and  image  model  coders.  In  the  second  stage  the 
accuracy  of  representation  is  reduced,  but,  in  a  way  that  maintains  the  required  image 
quality.  This  stage  is  called  quantization.  The  quantization  levels  and  the  corresponding 
receiver  decision  levels  are  system  parameters  that  must  be  determined.  Two  types  of 
quantizers  are  used,  namely  scalar  and  vector  quantizers.  Both  types  of  quantization  are 
irreversible.  The  last  stage  in  which  any  redundancy  in  the  signal  is  removed  is  called 
codeword  assignment  Two  types  of  codewords  can  be  used,  namely  equal-length  code¬ 
words  and  variable-length  codewords.  In  the  case  of  equal-length  codewords,  the  bit  rate 
of  individual  signal  values  and  the  average  bit  rate  are  the  same,  while,  in  the  case  of 
variable-length  codewords  they  are  not.  In  the  latter  case,  different  bit  rates  result  for 
different  signal  values.  As  an  example,  the  shortest  codeword  may  correspond  to  the 
most  probable  signal  value,  i.e.  that  which  occurs  most  frequently.  The  longest  code¬ 
word  may  correspond  to  the  least  probable  signal  value.  The  average  bit  rate  may 
increase  in  this  case.  The  codeword  assignment  operation  is  reversible. 

In  this  chapter  the  2-D  spectral  analysis  models  developed  earlier  are  applied  to  the 
coding  of  color  images.  We  are  concerned  here  with  predictive  coding,  i.e.  linear  pred¬ 
iction  followed  by  quantization  of  the  prediction  error.  Two  algorithms  are  compared 
initially.  In  the  first,  the  whole  frame  of  the  image  is  divided  into  subframes  and  the 
predictor  coefficients  are  computed  separately  for  each.  In  the  second,  predictor 
coefficients  are  obtained  for  the  whole  frame  of  the  image.  Both  of  these  methods  have 
the  disadvantage  that  the  linear  prediction  coefficient  matrices  must  be  computed  in  real 
time  and  transmitted  to  the  receiver  as  side  information.  This  significantly  increases  com¬ 
plexity  of  the  coding  system.  As  an  alternative  we  consider  using  a  fixed  set  of  predic¬ 
tion  matrices,  i.e.  ones  that  do  not  depend  on  the  specific  image  being  coded.  In  this  way 
both  receiver  and  transmitter  have  the  linear  prediction  matrices  and  no  side  information 
has  to  be  transmitted.  Such  prediction  matrices  can  be  generated  by  various  averaging 
methods  discussed  in  the  chapter.  We  compare  the  results  of  this  coding  to  that  resulting 
from  the  previous  two  methods. 
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B.  PREDICTIVE  CODING 

Predictive  coding,  (also  known  as  differential  pulse  code  modulation  (DPCM)) 
involves  2-D  linear  prediction  applied  to  the  image  and  transmission  of  the  quantized 
error  residuals.  The  prediction  of  the  sample  to  be  encoded  in  predictive  coding  is  gen¬ 
erated  from  the  previously  coded  samples  (Jayant,1974;  Netravali  et  al,1980).  For  our 
case  we  will  be  dealing  with  the  three  (red,  green,  and  blue)  components  of  a  color  image 
so  the  prediction  process  is  multichannel  2-D  linear  prediction.  The  vector-valued  error 
e(nl,n2)  that  is  the  difference  between  the  predicted  value  and  the  actual  value  of  the 
color  element  x(n  is  quantized  into  one  of  a  number  of  given  discrete  levels.  These 
levels  are  assigned  to  codewords  which  may  be  fixed  or  variable  depending  on  the  type 
of  coding  used.  The  encoded  information  is  then  transmitted  through  the  channel  to  the 
receiver. 

The  signal-to-noise  ratio  (SNR)  for  a  given  bit  rate  is  actually  increased  by 
representing  the  coded  input  signal  in  terms  of  the  prediction  error  samples.  In  this  case 
the  predictive  gain  can  be  written  as: 

G  =  £  x(n  ltn 2)  (n  2^  |£,[e(n1,n2)er(«i^2^|  (6-1) 

Note  that  G  is  an  MxM  matrix  where  M  represents  the  number  of  2-D  channels  (M=3  for 
the  color  images). 

Fig.  6.2  shows  the  general  predictive  coding  system.  Fig.  6.2(a)  shows  the 
transmitter  part  of  the  system  while  Fig.  6.2(b)  shows  the  receiver  part.  Since  our  con¬ 
cern  here  is  with  the  image  encoding  and  decoding  problem,  we  have  omitted  the  modu¬ 
lator  and  the  demodulator  from  Fig.  6.2.  It  is  clear  from  the  figure  that  the  predictive 
coder  has  three  main  components,  namely  the  linear  predictor,  the  quantizer,  and  the 
channel  encoder.  In  fact  the  linear  predictor  is  the  main  part  of  predictive  coding  system 
and  the  complexity  of  this  system  depends  on  the  type  of  prediction  algorithm  used. 

The  basic  predictive  coding  equations  can  be  written  as  follows 


e(n  j.n  2)  =  x(n  j,/i2)  -  x{nx,n  2) 


e<7 ( n \,n2)  =  e(n \,n2)~  q(n  t,n2) 


(6  2a) 


(6.2b) 
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(b)  Receiver 


Block  diagram  of  predictive  coding 


X(rt1,/l2)=*('»l.*2)  + V',l*n2)  (6.2c) 

where  e^n^/tj)  represents  the  quantized  prediction  error  of  the  color  image,  q(n  ltn2) 
represents  the  quantization  error,  x(n  x,n^)  is  the  predictive  coding  approximation  of  the 
original  color  input  signal,  and  x(nlrn 2)  is  the  predicted  value  of  x(n1,n2).  This  notation 
is  different  from  that  used  in  Chapters  II  to  IV  where  xnot  x  represented  the  predicted 
value.  We  use  the  present  notation  to  be  more  consistent  with  what  is  conventional  in  the 
literature  for  image  coding.  Note  that  here  x  is  an  approximation  to  x  and  x  is  the  linear 
predicted  value  based  on  x. 

A  predictive  coding  system,  which  is  easier  to  construct  than  the  predictive  coding 
system  just  described,  is  shown  in  Fig.  6.3.  Gibson  (1980)  called  it  the  prediction  error 
coder  to  differentiate  it  from  the  previous  system.  The  disadvantage  of  the  prediction 
error  coder  is  the  accumulation  of  quantization  noise  at  the  receiver.  The  reason  for  that 
can  be  explained  as  follows.  The  same  linear  predictor  is  used  in  the  transmitter  and 
receiver  but  the  input  signal  to  each  linear  predictor  is  not  the  same.  In  the  transmitter 
the  input  signal  is  the  actual  signal  to  be  transmitted  while  in  the  receiver  the  input  signal 
to  the  linear  predictor  is  the  quantized  version.  Because  of  this  difference  and  the  positive 
feedback  in  the  receiver  (as  shown  in  Fig.  6.3),  quantization  noise  tends  to  accumulate. 
In  the  usual  DPCM  system  shown  in  Fig.  6.2,  both  transmitter  and  receiver  use  the  same 
quantized  values  as  input  to  the  linear  predictor.  Thus  quantization  noise  does  not 
accumulate. 

1.  Linear  Prediction  of  Color  Images 

The  most  important  part  of  the  prediction  coding  system  is  the  linear  predictor. 
Linear  prediction  for  multichannel  2-D  random  processes  was  developed  and  studied  in 
Chapter  III.  Color  images,  as  we  mentioned  before,  can  be  considered  as  3-channel  2-D 
random  processes.  If  x(n  n 2)  are  a  set  of  color  picture  elements,  a  linear  prediction  for 
the  (n !  ,n2)th  color  element  using  previous  elements  can  be  written  as 


x(nhn2)=-  £  A^ljx(n1-i1,n2-i2) 

(iuii)ea.  (6.3) 


rjr*  v 


where  a  is  the  region  of  support  for  the  prediction  filter  and  x'  represents  the  prediction 
based  on  the  true  image  values.  The  filter  coefficients  A,il2  can  be  obtained  by  minimiz¬ 
ing  the  mean-square  prediction  error  and  solving  a  set  of  Normal  equations  in  the  form  of 

R  A  =  S  (6.4) 

as  described  in  detail  in  Chapter  m.  In  the  above  analysis  we  neglect  the  effect  of  quanti¬ 
zation  in  the  predictive  coding  system  at  the  transmitter.  Since  the  quantization  error  is 


small,  so  that 


x(/ilt/i2)~  *(«!•«  2) 


the  resulting  filter  coefficients  are  still  approximately  optimal.  In  the  usual  DPCM 
encoder  of  Fig.  6.2  the  linear  prediction  of  the  present  sample  x(n\,n£  is  done  by  using 
the  past  encoded  estimated  sample  values. 


x(/ilf/i2)=-  X  am2x("  \~lun  2-i  2) 

(iuii)ea  (6-6> 

(i^iMO.O) 

Then  the  prediction  error  of  the  color  image 

e(n  1^2) =  x(n  i’n2)  ~  *(n  i’n  2)  (6-7) 

is  quantized  and  transmitted. 

In  a  practical  situation  the  intensity  image  possesses  a  nonzero  mean  value 
which  is  referred  to  as  a  bias.  Maragos  et  al  (1984)  state  three  different  ways  to  handle 
this  bias.  The  first  way  is  by  estimating  the  bias  and  representing  it  in  the  difference 
equation.  This  method  is  called  true  bias  linear  prediction.  A  second  method  is  to  esti¬ 
mate  the  local  mean  of  the  signal,  subtract  it  from  the  original  signal,  then  use  the  ordi¬ 
nary  Normal  equations  to  get  the  filter  coefficients.  This  approach  is  called  local  mean 
linear  prediction.  Finally  one  can  decide  not  to  subtract  any  estimate  of  the  bias  and  sim¬ 
ply  solve  the  Normal  equations.  We  refer  to  this  technique  as  straight  linear  prediction. 
We  use  the  second  method  in  our  experiments. 
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2.  Image  Quantization 


The  signal  values  to  be  transmitted,  (i.e.  the  prediction  error  values),  are  origi¬ 
nally  developed  as  continuous-amplitude  analog  quantities.  To  deal  with  these  analog 
quantities  through  a  digital  system  we  have  to  represent  each  signal  value  by  an  integer 
number  proportional  to  the  amplitude  of  the  analog  signal.  Hence  the  analog  signal  is 
sampled  and  a  discrete  representation  of  the  samples  is  produced.  This  conversion  pro¬ 
cess  is  called  quantization.  As  mentioned  earlier,  there  are  two  types  of  quantization, 
namely  scalar  quantization  and  vector  quantization.  Our  concern  here  with  scalar  quanti¬ 
zation.  Vector  quantization  is  addressed  briefly  in  Appendix  G. 

For  the  development  of  the  scalar  quantization  technique,  let  x  represent  the 
amplitude  of  the  scalar  signal  sample  which  may  represent  a  value  for  the  prediction 
error  (pixel  intensity).  Then  scalar  quantization  can  be  interpreted  as  a  method  of 
representing  x  with  a  finite  number  of  bits  corresponding  to  one  of  the  J  given  levels  of 
the  signal.  The  quantized  value  x  can  be  expressed  as 

x=Q(x)  (6.8) 

where  Q  represents  the  quantization  operation.  The  quantization  problem  is  to  specify  a 
set  of  decision  levels  dj  and  a  set  of  reconstruction  levels  r}  such  that  if  x  falls  between 
dj_i  and  dj ,  it  is  mapped  to  the  reconstruction  level  r; .  If  the  reconstruction  levels  and 
decision  levels  are  uniformly  spaced  we  refer  to  this  method  as  uniform  quantization. 

In  general  the  decision  and  reconstruction  levels  are  chosen  to  minimize  the 
average  distortion  D  defined  by 


D  =E^d( xj) 


where  d  is  a  measure  of  the  local  distortion  at  each  pixel.  For  example  in  the  case  of  a 
mean-square  error  quantization  criteria,  the  error  eg  can  be  expressed  as 


and  d  is  defined  as 


en  =x  -x 


=  ej  =  [x-x 


(6.10) 


(6.11) 


The  average  distortion  can  then  be  written  in  the  form  (Pratt,  1978) 


where  J  represents  the  number  of  quantization  levels  and  p  (x )  is  the  probability  density 
function  of  the  signal  x.  The  optimum  values  of  the  reconstruction  levels  and  the  deci¬ 
sion  levels  can  be  determined  by  minimizing  D  with  respect  to  rk  and  dk.  If  in  the  case 
of  Eqn.  (6.12)  we  require 

-^-=0  (6.13a) 


(6.13b) 


for  all  values  of  k.  Then  we  can  obtain  the  following  results 

^*♦1 

j  x  p(x)dx 
<4 

r*  =  — - 

j  p(x)dx 


(6.14) 


rk  +  rk+ 1 


(6.15) 


We  see  from  the  above  two  equations  that  the  reconstruction  level  rk  is  the  centroid  of 
p(x)  over  the  range  dk<x<dk+l  and  the  decision  level  dk  is  the  mid-point  between  two 
adjacent  reconstruction  levels. 

Solving  the  two  equations  (6.14)  and  (6.15)  is  a  non-linear  problem.  Max 
(1960)  solved  the  problem  for  the  cases  of  uniform,  Gaussian,  Laplacian,  and  Rayleigh 
densities.  The  results  are  put  in  tables  (Pratt, 1978;  Jayant  and  Noll, 1984)  and  we  refer  to 
this  quantizer  as  the  Max  quantizer. 

Fig.  6.4  represents  a  specific  example  of  the  uniform  quantizer  for  two  and  three 
levels.  Fig.  6.4(a)  represents  a  two-level  quantizer  with  step  size  A  while  i  ig.  6.4(b) 
represents  the  three-level  quantizer  with  threshold  levels  d=d(\),d(2)  and  reconstruction 
levels  r  -r  ( 1  ),r  (2),r  (3). 


(a)  Characteristics  of  two-level  quantizer 


(b)  Characteristics  of  three-level  quantizer 
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Nonuniform  quantization  has  been  considered  as  another  type  of  quantization, 
(Jayant,1974).  In  this  case,  as  we  go  further  from  the  center  of  the  quantizer,  the  step 
size  increases.  The  advantage  of  this  quantizer  is  that  a  reduction  in  bit  rate  for  a  given 
quality  of  encoding  can  usually  be  obtained. 

A  more  recent  type  of  scalar  quantizer  is  the  adaptive  quantizer.  Here  the  step 
size  is  modified  according  to  the  status  of  the  quantizer  memory.  In  particular  the  present 
step  size  is  multiplied  by  a  factor  depending  on  previous  knowledge  about  the  step  size 
used  for  the  past  samples.  (Jayant,1974) 

Poor  design  of  the  quantizer  used  in  predictive  coding  can  cause  different  types 
of  degradation,  such  as  granular  noise,  edge  busyness,  and  slope  overload.  These  effects 
are  illustrated  in  Fig.  6.5  (Netravali  et  al,1980).  Granular  noise  is  caused  by  the  coarse¬ 
ness  of  the  minimum  quantizer  step  in  the  coding  of  a  signal  which  has  a  very  slow 
change  in  the  shape  of  its  waveform.  Slope  overload  distortion  appears  when  the  max¬ 
imum  value  of  the  quantizer  step  is  not  enough  to  follow  a  suddenly  and  rapidly  chang¬ 
ing  input.  Edge  busyness  is  a  distortion  specific  to  image  coding  that  occurs  when  there 
is  a  gradual  change  in  the  contrast  of  the  edges.  In  this  case  the  output  of  the  quantizer 
oscillates  between  two  levels  around  the  actual  value  of  the  signal.  This  oscillation  may 
cause  a  change  from  one  line  to  the  other  which  gives  the  appearance  of  a  "busy  edge". 
(Netravali  and  Limb,  1980) 

The  characteristics  and  the  requirements  on  a  given  quantizer  depend  on  the 
type  of  application.  As  an  example,  the  requirements  on  the  quantizer  in  a  moving  area 
of  an  image  is  completely  different  from  the  requirements  for  the  background  areas.  In 
case  of  background  a  low  granular  noise  effect  is  required  and  this  can  be  achieved  by 
designing  a  fine  quantizer  (small  step  size).  In  the  case  of  moving  areas,  slope  overload 
distortion  must  be  avoided.  To  satisfy  this  we  have  to  choose  a  coarse  quantizer  with 
high  dynamic  range. 

In  general  a  tradeoff  between  coarse  and  fine  quantizers  must  be  determined.  A 
fine  quantizer  will  generate  the  slope  overload  distortion  which  can  be  seen  as  an  unclear 
sharp  edge;  while  a  coarse  quantizer  may  produce  the  granular  noise  and  edge  busyness 
discussed  above. 
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Fig.  6.5  Illustration  of  waveform  coding  by  predictive  coding 
and  a  different  types  of  reconstruction  error 
(Netravali  and  Limb. 1980). 
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Various  measures  have  been  developed  to  measure  the  overall  performance  of  a 
predictive  coder.  The  most  widely  used  measures  are  the  normalized  mean  square  error 
(NMSE)  and  the  signal-to-noise  ratio  (SNR).  These  can  be  defined  in  case  of  multichan¬ 
nel  2-D  random  processes  as  follows: 


(6.16) 


NMSE  =  Cover  ^(n^/ij)  -  x(fli,/»2)j  |c<?var  ^x(/ilt/i 2)j 
SNR  (dB )  =  10  log  [NMSE  j  -1 


(6.17) 


where  Cover  [ .  ]  represents  the  covariance  matrix  for  the  quantity.  Note  that  the  NMSE 
and  the  SNR  are  in  the  form  of  matrices  with  dimension  MxM.  It  is  known  that  the  SNR 
is  not  a  perfect  measurement  of  image  quality.  There  are  cases,  as  we  will  see  in  the  next 
section,  where  the  coder  that  has  the  highest  value  of  the  SNR  generates  a  reconstructed 
signal  which  is  subjectively  lower  in  quality  than  one  generated  by  a  lower  SNR  coder 
(Gibson,  1980).  The  reason  for  that  lies  in  the  dependence  of  the  quality  on  the  amplitude 
of  the  quantization  noise  and  not  on  its  power  (see  Eqn.  (6.16)).  However  the  SNR 
defined  by  Eqn.  (6.17)  is  still  the  measure  most  widely  used  because  of  its  computational 
simplicity  and  ease  of  interpretation. 

3.  Side  Information 

In  addition  to  quantizing  and  transmitting  the  residual  of  the  color  image,  we 
have  to  also  quantize  and  transmit  the  predictor  filter  coefficients.  We  refer  to  this  as 
side  information .  The  size  of  the  predictor  coefficient  matrix  of  the  color  image  depends 
on  the  model  order  used.  As  an  example  a  first  order  NSHP  model  has  12  (matrix) 
parameters,  while  an  eighth  order  NSHP  model  has  381  such  parameters.  These  numbers 
of  parameters  are  very  large  compared  to  the  corresponding  number  of  parameters  for  the 
single  channel  2-D  case. 

The  size  of  the  predictive  filter  is  of  concern  not  only  because  of  the  amount  of 
side  information  that  needs  to  be  transmitted  but  also  because  of  the  real  time  computa¬ 
tion  requirement.  For  predictive  coding  systems  a  set  of  Normal  equations  needs  to  be 
solved  to  get  the  predictor  parameters.  This  operation  uses  the  image  data  itself  and  is 
done  in  real  time.  This  significantly  slows  the  predictive  coding  process  and  increases 
the  complexity  of  the  coding  system. 


In  this  research  we  considered  a  possible  way  to  overcome  the  disadvantage  of 
the  real  time  computation  and  transmission  of  side  information.  The  idea  is  to  select  a 
fixed  set  of  predictor  parameters  and  apply  it  directly  to  the  data.  In  this  case  no  real 
time  computations  are  required,  and  there  is  no  need  to  transmit  the  side  information. 

The  criteria  for  selecting  the  fixed  predictor  parameters  will  be  explained  in  detail  in  the 
next  section. 

C.  CODING  EXPERIMENTS  WITH  COLOR  IMAGES 

In  this  work  a  color  image  is  represented  by  its  red,  green,  and  blue  (RGB)  com¬ 
ponents.  An  alternative  representation  that  may  have  some  advantages  is  in  terms  of 
luminance-chrominance  (YIQ)  components.  However  this  is  not  explored  here.  • 

The  original  pictures  used  in  this  thesis  are  256x256  pixels  in  size  but  in  order  to  stay 
within  our  computational  budget  we  selected  portions  of  the  original  pictures  with  sizes 
of  128x128  pixels  as  shown  in  Fig.  6.6.  A  variety  of  pictures  used  in  the  experiments  are 
shown  in  the  figure. 

The  linear  predictive  models  developed  in  this  thesis  were  applied  to  the  coding  of 
these  color  images.  Two  different  procedures  were  used  initially.  In  the  first  procedure 
the  total  frame  of  the  image  is  divided  into  four  subframes  with  64x64  pixels  each.  The 
predictor  coefficients  and  the  error  covariance  matrix  are  obtained  separately  for  each 
subframe  by  solving  a  set  of  Normal  equations.  The  different  linear  predictive  filters 
derived  are  then  applied  to  the  different  subframes.  The  residual  error  covariance  matrix 
and  the  set  of  filter  coefficients  for  each  subframe  are  coded  and  transmitted.  Since  there 
is  a  specific  filter  for  each  subframe,  a  discontinuity  appears  at  the  subframe  edges.  To 
overcome  this  effect  we  overlap  the  subframes  by  four  pixels  and  average  the  results  in 
the  overlapped  regions. 

In  the  second  procedure,  the  whole  frame  of  the  image  is  taken,  and  the  Normal 
equations  are  solved  to  get  only  one  set  of  predictor  matrices.  This  single  error  covari¬ 
ance  matrix  and  the  set  of  filter  coefficients  for  the  whole  frame  are  coded  and  transmit¬ 
ted  along  with  the  residuals  as  in  the  first  procedure. 

Note  that  in  both  of  these  procedures  the  predictor  matrices  must  be  calculated  in 
real  time.  In  the  first  case  the  real  time  calculation  is  done  for  each  subframe  and  in  the 
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second  case  it  is  done  for  the  whole  frame.  These  real  time  computations  and  the  need  to 
send  results  as  side  information,  greatly  increase  the  complexity  of  the  coding  system. 

As  an  alternative  we  considered  two  new  procedures  which  overcome  the  disadvan¬ 
tages  of  the  previous  ones.  In  the  first  of  these  new  procedures  a  fixed  set  of  predictor 
matrices  is  used.  We  refer  to  this  in  the  experiments  as  the  third  procedure.  The  fixed 
set  of  predictor  matrices  is  chosen  on  the  basis  of  the  following  criteria.  The  filter 
parameters  for  multichannel  2-D  linear  prediction  are  determined  for  each  of  the  dif¬ 
ferent  images  separately.  We  use  all  the  original  pictures  shown  in  Fig.  6.6  except  the 
jelly  bean  picture  of  Fig.  6.6c.  A  new  fixed  set  of  parameters  is  obtained  by  averaging 
the  parameters  corresponding  to  the  different  images.  Then  this  new  set  is  applied  in  our 
process. 

In  the  second  new  procedure  a  fixed  autocorrelation  matrix  is  generated  by  averaging 
a  group  of  autocorrelation  matrices  estimated  from  the  same  data,  i.e.  all  images  except 
the  jelly  beans.  The  predictor  parameters  are  developed  by  solving  a  set  of  Normal  equa¬ 
tions  involving  the  fixed  average  autocorrelation  matrix.  As  in  the  previous  case  the  resi¬ 
dual  is  quantized  and  sent  to  the  receiver  and  there  is  no  need  to  send  any  side  informa¬ 
tion.  We  refer  to  this  in  the  experiments  as  the  fourth  procedure. 

For  purposes  of  the  coding  experiments  two  images  are  used.  The  first  is  the  lady’s 
face  of  Fig.  6.6(a)  which  is  one  of  the  images  used  to  derive  the  fixed  set  of  prediction 
parameters.  The  second  is  the  jelly  bean  picture  of  Fig.  6.6(c)  which  was  not  included  in 
the  set  of  images  used  to  derive  the  fixed  parameters.  For  each  of  these  images  the  resi¬ 
dual  is  quantized  and  transmitted.  No  side  information  needs  to  be  transmitted  and 
this  procedure  requires  no  real  time  computation. 

Different  experiments  are  given  to  show  the  performance  of  the  third  and  fourth 
(new)  procedures  by  comparing  their  results  with  the  results  of  the  first  and  second  (ori¬ 
ginal)  procedures.  Although  our  experiments  here  used  scalar  quantization,  the  ideas  can 
also  be  applied  to  the  case  of  vector  quantization  (see  Appendix  G).  For  the  scalar  quan¬ 
tizer  two  and  three  levels  of  quantization  are  used.  Linear  prediction  is  performed  using 
second  and  third  order  models.  The  SNR  measurement  defined  by  Eqn.  (6.17)  is  used  as 
a  measure  of  the  performance  of  the  different  predictive  coders  used  in  this  section. 


In  this  experiment  we  used  a  uniform  two-level  quantizer  with  step  size  A 
defined  by 

A  =  ya2  (6.18) 

where  y  is  a  factor  controlling  the  dynamic  range  of  the  quantizer  (this  also  has  an  effect 
on  the  granular  noise  and  the  sharpness  of  the  edges)  and  o2  represents  the  prediction 
error  variance.  The  input  output  characteristic  of  this  uniform  2-level  quantizer  is  given 
in  Fig.  6.4a.  The  behaviour  of  the  quantizer  is  varied  by  varying  the  step  size  A.  A 
second  order  linear  predictive  filter  and  a  2-level  quantizer  with  different  values  of  step 
sizes  is  used  in  this  experiment. 

Four  different  step  size  quantization  levels  are  considered  here 
(y=  1.5,  0.7979,  0.7071,  and  0.5)  starting  with  a  2-level  quantizer  with  large  step  size 
(y=  1.5).  Fig.  6.7  shows  the  results  of  the  four  procedures  where  (a)  is  the  result  of  using 
the  first  procedure  (dividing  total  frame  into  four  subframes  i.e.  L=4),  (b)  is  the  result  of 
using  the  second  procedure  (using  the  whole  frame  i.e.  L=l),  (c)  shows  the  simulation 
results  using  the  third  procedure  (applying  the  averaged  set  of  filter  coefficients),  and  (d) 
is  the  result  of  the  fourth  procedure  (averaging  the  autocorrelation  function). 

When  the  reconstructed  coded  pictures  are  compared  with  the  original  picture 
shown  in  Fig.  6.6a  it  seems  that  sharp  edges  are  very  clear  for  all  the  procedures  but 
granular  noise  is  apparent  in  the  reconstructed  pictures.  Note  that  the  alternative  pro¬ 
cedures  (three  and  four)  produce  good  coded  results  which  are  quite  similar  to  the  results 
of  the  first  and  second  procedures. 

Error  pictures  of  the  four  procedures  are  shown  in  Fig.  6.8.  These  are  generated 
by  taking  the  difference  between  the  original  picture  and  the  reconstructed  one.  Fig.  6.9 
shows  the  error  picture  for  the  same  procedures  but  after  quantization.  Most  of  the 
essential  information  is  retained  in  the  quantized  error  picture. 

Table  6.1  shows  the  SNR  values  obtained  by  applying  the  four  different  pro¬ 
cedures,  for  the  red,  green,  and  blue  components  of  the  color  picture.  The  SNR  of  the 
second  procedure  is  given  as  the  average  of  the  values  obtained  for  the  four  different  sub- 
frames.  It  is  clear  from  the  table  that  the  SNR  values  for  the  various  procedures  are  close 


Fig.  6.7  Coded  images  (7=1.5). 

(a)  First  procedure  (b)  Second  procedure 
(c)  Third  procedure  (d)  Fourth  procedure 
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Fig.  C.8  Error  images  ( •*)  =  1.5). 

(a)  First  procedure  (b)  Second  procedure 
(c)  Third  procedure  (d)  Fourth  procedure 
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Fig.  G.9  Quantized  error  images  (0  =  1.5). 

(a)  First  procedure  (b)  Second  procedure 
(c)  Third  procedure  (d)  Fourth  procedure 
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to  each  other.  The  highest  SNR  values  are  obtained  for  the  first  procedure  but  there  is 
only  about  a  0.5  dB  improvement 

As  a  second  simulation  for  this  example,  the  step  size  is  given  a  smaller  value 
(Y  =  0.7979).  This  value  is  taken  from  the  Max  table  for  the  Gaussian  distribution.  Fig. 
6.10  shows  the  corresponding  results  using  the  same  four  procedures.  Good  results  are 
obtained  for  non-edge  regions  and  good  sharp  edges  are  also  obtained.  The  SNR 
results,  which  measure  the  performance  of  the  reconstructed  pictures,  are  given  in 
Table  6.2.  The  first  procedure  gave  an  improvement  of  0.5  dB  while  the  third  and 
fourth  procedures  lost  0.5  dB  with  respect  to  the  second  procedure. 

Fig.  6.1 1  and  Table  6.3  show  the  results  of  choosing  y  corresponding  to  a  Lapla- 
cian  distribution  (y  =  0.7071).  Qualitatively  we  get  the  same  results  as  in  the  recon¬ 
structed  pictures  as  for  Gaussian  distribution.  However  there  is  a  1  dB  improvement  in 
the  SNR. 

Finally  the  results  of  the  last  simulation  for  this  experiment  are  given  in  Fig. 
6.12  and  Table  6.4.  In  this  case  a  smallta  step  size  is  used  (y  =  0.5).  In  this  case  the 
predictive  coding  system  provides  good  results  in  the  non-edge  regions  but  the  edges  are 
blurred  out  as  we  expect.  At  the  same  time  there  is  an  improvement  of  approximately  3 
dB  in  the  SNR  over  the  Laplacian  case. 

To  complete  this  experiment  and  to  check  the  performance  of  the  new  pro¬ 
cedures  on  a  picture  that  was  not  in  the  set  used  to  derive  average  parameters,  the  jelly 
bean  picture  (Fig.  6.6c)  was  tested.  For  this  case  a  two-level  quantizer  with  y=  0.7071 
was  used.  Fig.  6.13  shows  the  quality  of  the  picture  obtained  by  applying  the  four  dif¬ 
ferent  procedures.  It  is  clear  that  the  quality  of  the  reconstructed  picture  for  procedures 
three  and  four  is  similar  to  that  for  the  first  and  second  procedures.  Table  6.5  shows  the 
SNR  results.  The  highest  SNR  appeared  for  the  picture  coded  by  applying  the  first  pro¬ 
cedure.  The  second,  third,  and  fourth  procedures  yielded  SNR  results  that  are  very  close. 

The  foregoing  results  show  that  using  a  l  sed  set  of  prediction  parameters  yields 
good  image  reconstructions  which  are  close  to  the  results  of  using  real-time  computed 
filter  parameters.  Also  the  variation  of  the  SNR  from  one  procedure  to  the  other  is  small. 
The  new  procedures  Can  be  applied  to  anv  image  outside  the  training  set. 


G.10  Code  d  images  (")  =  0.7979). 

(a)  First  procedure  (b)  Second  procedure 
(c)  Third  procedure  (d)  Four'll  procedure 


TABLE  6.1  SNR  FOR  CODED  IMAGES  USING  2-LEVEL 
QUANTIZER  WITH  y=  1.5 


Procedure 

Red 

Green 

Blue 

First  Procedure 

16.37 

13.44 

11.28 

Second  Procedure 

15.78 

13.40 

10.61 

Third  Procedure 

15.39 

13.20 

9.86 

Fourth  Procedure 

15.38 

13.20 

9.86 

TABLE  6.2  SNR  FOR  CODED  IMAGES  USING  2-LEVEL 
QUANTIZER  WITH  Y=  0.7979 


Procedure 

Red 

Green 

Blue 

First  Procedure 

21.85 

18.93 

16.77 

Second  Procedure 

21.26 

18.88 

16.09 

Third  Procedure 

20.87 

18.68 

15.35 

Fourth  Procedure 

20.87 

18.68 

15.35 

TABLE  6.3  SNR  FOR  CODED  IMAGES  USING  2-LEVEL 
QUANTIZER  WITH  y=  0.7071 


Procedure 

Red 

Green 

Blue 

First  Procedure 

22.90 

19.98 

17.82 

Second  Procedure 

22.30 

19.93 

17.14 

Third  Procedure 

21.92 

19.73 

16.45 

Fourth  Procedure 

21.90 

19.73 

16.44 

TABLE  6.4  SNR  FOR  CODED  IMAGES  USING  2-LEVEL 
QUANTIZER  WITH  y  =  0.5 


Procedure 

Red 

Green 

Blue 

First  Procedure 

25.91 

22.99 

20.83 

Second  Procedure 

25.32 

22.94 

20.15 

Third  Procedure 

24.93 

22.74 

19.41 

Fourth  Procedure 

24.93 

22.74 

19.41 

TABLE  6.5  SNR  FOR  BEAN  CODED  PICTURE  USING  2-LEVEL 
QUANTIZER  WITH  y  =  0.707 1 


Procedure 

Red 

Green 

Blue 

First  Procedure 

22.87 

21.98 

20.00 

Second  Procedure 

22.48 

21.90 

19.86 

Third  Procedure 

21.35 

21.76 

19.05 

Fourth  Procedure 


21.35 


21.76 


19.05 


In  addition  a  2-level  quantizer  is  sufficient  for  these  pictures.  The  bit  rate  in  this  case  is 
reduced  from  8  bits  to  1  bit  per  pixel  for  each  color  component. 

The  step  size  of  the  quantizer  is  very  important  as  we  mentioned  before.  A 
small  step  size  gave  good  non-edge  results  with  poor  edge  sharpness.  A  large  step  size 
provided  sharp  edges  but  granular  noise  in  the  non-edge  areas.  Practically  the  step  size 
should  depend  on  the  application.  Small  step  sizes  can  be  used  when  sharp  edges  are  not 
important.  For  sharp  edge  requirements  a  large  step  size  should  be  used. 

2.  Predictive  Coding  with  a  Three-Level  Quantizer 

The  dynamic  range  of  the  residual  signal  may  sometimes  be  too  large  to  be  well 
handled  by  a  2-level  quantizer.  This  can  lead  to  excessively  large  distortion  in  the  recon¬ 
structed  pictures.  Also  in  the  case  of  a  two-level  quantizer  it  is  difficult  to  control  the 
granular  noise  and  at  the  same  time  keep  sharp  detail  in  the  edges.  Three-level  quantiz¬ 
ers  attempt  to  overcome  this  problem  by  providing  a  zero  level  for  small  amplitudes  of 
the  residual  signal  and  two  other  side  levels  for  large  amplitudes. 

The  performance  of  the  color  image  coding  algorithms  described  earlier  has 
been  studied  using  a  three-level  quantizer  with  input-output  characteristics  given  in  Fig. 
6.4b.  In  this  figure,  let  r(l)=A,  r  (3)=-A,  d(l)=-d,  and  d(2)=d.  In  this  case  the  quan¬ 
tized  residual  signal  can  be  written  in  the  form 

A 

e<J(nl,n^)=-  0  -d  <  e(n j,n2)  <  d  (6.19) 

-A  e(nltnj)^-d 

where  the  reconstruction  and  decision  levels  control  the  dynamic  range  of  the  residual 
signal. 

Two  different  examples  are  considered  here.  In  the  first  example  we  assume 

that  A  is  constant  and  d  =  Fig.  6.14  shows  the  results  of  the  simulated  reconstructed 

images  using  a  linear  predictive  model  of  the  second  order  and  a  uniform  fixed  three- 
level  quantizer  with  A=6  and  d=  3.  As  in  the  previous  section  the  four  different  pro¬ 
cedures  for  determining  linear  prediction  parameters  are  applied  in  this  example.  The 
SNR  results  are  shown  in  Table  6.6.  In  the  second  example  we  assume  a  uniform  three- 


Fig.  C.l 4  Coded  images  (_^  =  6.  d  =  3). 

(a)  First  procedure  (l>)  Second  procedure 
(c)  Third  procedure  (d)  Fourth  procedure 


Coded  images  using  3-level  quantizer  and  second  order 
linear  predictive  model  (")  =  3.  p  =  1.3). 

(a)  Fir>t  procedure  (b)  Second  procedure 
(c)  Third  procedure  (d)  Fourth  procedure 


Fig.  6. 1C  Coded  images  using  3-level  quantizer  and  third  order 
linear  predictive  model  =  2.  p  =  1.3). 

(a)  First  procedure  (b)  Second  procedure 
(c)  Third  procedure  (d)  Fourth  procedure 
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TABLE  6.6  SNR  FOR  CODED  IMAGES  USING  3-LEVEL 
QUANTIZER  WITH  A=6,rf=3 


Procedure 

Red 

Green 

Blue 

First  Procedure 

22.97 

22.23 

18.32 

Second  Procedure 

22.64 

21.16 

17.92 

Third  Procedure 

22.54 

21.25 

17.74 

Fourth  Procedure 

22.50 

21.15 

17.68 

TABLE  6.7  SNR  FOR  CODED  IMAGES  USING  3-LEVEL  QUANTIZER 
AND  SECOND  ORDER  PREDICTIVE  MODEL  WITH  Y=2,p=1.3 


Procedure 

Red 

Green 

Blue 

First  Procedure 

19.73 

18.07 

16.02 

Second  Procedure 

19.72 

17.16 

15.40 

Third  Procedure 

19.45 

17.43 

15.02 

Fourth  Procedure 

19.38 

17.35 

14.64 

TABLE  6.8  SNR  FOR  CODED  IMAGE  USING  3-LEVEL  QUANTIZER  AND 
THIRD  ORDER  PREDICTIVE  MODEL  WITH  ?=2,p=1.3 


vn.  CONCLUSIONS 


This  thesis  described  model-based  methods  for  spectral  estimation  of  multiple  2-D 
signals.  Multichannel  2-D  power  spectral  estimation  based  on  AR  modeling  was  studied 
extensively.  We  presented  the  multichannel  models  and  gave  an  extension  of  the 
Jackson-Chien  procedure  for  combining  two  quadrant- based  estimates  into  a  single  com¬ 
bined  estimate  for  the  spectral  matrix.  The  latter  was  called  the  combined  quadrant(CQ) 
method.  We  further  presented  a  method  for  estimation  of  the  model  parameters  directly 
from  the  data.  We  also  proposed  a  measure  for  comparing  the  closeness  of  two  spectral 
estimates  based  on  the  divergence.  Forward  and  backward  NSHP  models  were  shown  to 
produce  different  estimates  for  the  spectral  matrix  but  the  estimates  were  observed  to  get 
close  to  each  other  as  the  order  of  the  model  increased.  Both  the  maximum  likelihood 
method  (MLM)  of  spectral  estimation  and  the  Improved  MLM  were  extended  to  estimate 
the  spectral  matrix  of  auto-  and  cross-spectra  for  multiple  2-D  random  processes.  Finally 
the  multichannel  2-D  spectral  analysis  models  were  applied  to  the  problem  of 
image  coding. 

Several  examples  were  given  of  computed  spectra  and  extensive  experimental  stu¬ 
dies  of  performance  were  carried  out  Phase  estimates  of  the  cross  spectra  for  all  of  the 
methods  were  consistently  accurate  and  were  relatively  insensitive  to  noise.  Peak  loca¬ 
tion  for  narrowband  cases  of  sinusoids  in  noise  was  however  noise  sensitive  and  showed 
more  bias  with  increased  noise  level. 

In  comparing  the  methods  presented,  the  CQ  spectral  estimate  compared  favorably 
with  the  NSHP  estimates  and  typically  required  less  computation.  Comparisons  between 
the  NSHP  and  CQ  methods  were  made  both  qualitatively  and  quantitatively  using  a 
variety  of  test  cases.  The  multichannel  MLM  was  found  experimentally  to  give  better 
results  than  those  obtained  when  the  auto-  and  cross-spectra  are  estimated  separately  by 
MLM.  Experimental  results  show  that  MLM  has  generally  poorer  resolution  than  AR 
based  methods.  However  the  IMLM  appears  to  have  almost  comparable  resolution  to  the 
AR.  Finally,  the  application  of  spectral  analysis  models  to  color  image  coding  shows 
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that  we  can  overcome  the  real  time  computational  problem  involved  in  computing  the 
predictive  coefficients  by  using  fixed  coefficient  matrices  derived  from  a  representative 
set  of  training  images.  This  reduces  complexity  of  the  coding  and  eliminates  the  need  to 
transmit  side  information. 

This  thesis  can  be  considered  as  a  first  attempt  at  spectral  analysis  of  multichannel 
two-dimensional  random  processes.  More  experience  needs  to  be  gathered  in  application 
of  these  methods  to  practical  problems  involving  measured  data.  Further,  other  methods 
of  spectral  analysis  such  as  the  signal  and  noise  subspace  methods  were  not  studied  in 
this  work  and  could  be  explored.  In  addition  further  application  of  multichannel  models 
to  image  coding  should  be  looked  at.  The  latter  topic  was  dealt  with  only  as  a  brief 
excursion  here  but  the  results  obtained  are  quite  encouraging. 
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APPENDIX  (A) 

2— D  BURG  ALGORITHM 


As  mentioned  in  Chapter  n,  Burg  developed  an  algorithm  for  estimating  1-D  linear 
prediction  parameters  directly  from  the  data.  No  estimation  of  the  autocorrelation  func¬ 
tion  is  required  in  this  algorithm.  Given  the  2-D  reflection  coefficient  representation 
developed  by  Marzetta,  the  1-D  Burg  algorithm  can  be  straightforwardly  extended  to  the 
2-D  case  as  follows.  Given  a  wide  sense  stationary  2-D  random  process  with  zero  mean 
x(nl,n2),  the  idea  of  forward  linear  prediction  is  to  estimate  the  "present"  sample 
from  a  weighted  sum  of  the  "past"  samples. 

jf(«1n2)  =  -  x  /A  1X 

(A.l) 

OViHO.O) 

where  a  represents  the  filter  region  of  support  and  aflij  represent  the  filter  coefficients. 

Similarly,  the  estimated  value  of  the  sample  using  backward  linear  prediction  can  be 
written  as 


x'(n1n2)  =  -  X  <W(«i+'i>"2+‘2) 

(A.z) 

OViMO.O) 

where  we  have  taken  account  of  the  fact  that  the  single  channel  forward  and  backward 
linear  predictive  filters  parameters  are  the  same.  Then  the  forward  and  the  backward 
error  signals  e(n  1,/i2)  and  e'(nltn2)  can  be  expressed  by 


e(*i/»2)  =  E  i^2“* 2) 


(A. 3) 


e'(nltn2)=  £  «,•„•!*(« i+*i.»2+«2) 

(i,.i2)ea 


(A.4) 


where  aoo=l.  These  values  of  the  error  signals  for  order  (p  ltp2)  can  be  written 
recursively  as 


■'«.  'V.  ■'u.  **„  ».  ■_ 


.  h.w  <r  ^  v  « .  k. 


V  •  *  «_  -  *  ^  V 


E^in  x,n2)  =  e^^in  ^  ,-p  ltn2-P2)  (A.5) 


e'^^(nl,n^  =  e^^%l,n^-K(Pl'P2)e(p^l\n1+pl,n2+p^  (A.6) 

where  K^Pl'p^  and  K,(p''p^  represent  the  forward  and  backward  reflection  coefficients 


respectively.  Then  the  corresponding  forward  and  backward  filter  of  order  (p\,p2)  can  be 
recursively  estimated  as 


H(PuP2)(zl,z2)  =  H(PuP^\zl,z2)-ziP'  z2P 1  K(p"p*  ,zil)  (A.7) 


HApuP2)(zlyz2)  =  H'(Pl’PT'l\zl,z2)-zPl1  zf  H^^l){z-X\z2x)  (A.8) 


Defining  the  total  estimated  error  power  P  as  the  average  of  all  possible  computed  for¬ 
ward  and  backward  squared  error  signals  for  the  data  results  in 


=  £-j  Zin^ni)  +  e'(n1,n2) 


(A.9) 


By  substituting  Eqns.  (A.5)  and  (A.6)  into  Eqn.  (A.9),  the  power  can  be  expressed 


p(Pi<Pi)  _  E  \^^%x,n2)-K^^z^^l\nl-pl,n2-p^ 


(A.  10) 


e'^1’^1)(n1,n2)-A:(/7l’/,j)e(/?1'/7z-1)(n1+p1,n2+p2) 


To  find  the  best  least  square  estimate,  the  total  power  is  minimized  with  respect  to  the 
reflection  coefficient  parameter  i.e. 


>  Ol^2> 


dK {pi,P:d 


(A. 1 1) 


The  solution  of  Eqn.  (A.  11)  is  found  to  be 


%  *.  V  .V  *»  %  \  '/% 


'  S*  Ni  C  *  '  «w '  v'  O 


g(PlJ>-L>  _ 


2  2  r  e<r'fri\ni,n2>  t-p  ,,n2-P2) 

(nt^i2)ea'-  -* 

S  +[  £/</>1'Pr’1)(rt  ]~p  1,«2“P2) 


Tl 


(A.  12) 


Then  the  average  power  can  take  the  form  (Marzetta,1978) 


p(Pl^2)  _p(Pl^>2-l)^  _  j 


(A.  13) 


Now,  to  begin  the  computation,  the  order  is  set  to  (0,0)  with  the  initial  con¬ 

ditions: 


ff<0-°W  2)  =  1 

(A.  14a) 

e(0,0)(n !  ,n  2)  =  x  (n  j  ,n  2) 

(A.  14b) 

Z^Xn^nd  =x(nltn2) 

(A.  14c) 

Pmm  =  E  2+^0.0)f (l>t„ J 2 

L  J  L  J 

(A.14d) 

For  /?!=()  and  (\<p2<p,2)  or  f°r  (l^p  i</> i)  and  (— /* 2<P 2<^> 2)»  //^'’^(z^z^)  is  com¬ 
puted  recursively  from  Eqn.  (A.7).  Also  the  forward  and  backward  error  signals  are 
computed  recursively  from  Eqns.  (A.5)  and  (A.6).  Between  adjacent  columns  the  fol¬ 
lowing  relations  are  used  for  the  transition  of  the  recursion: 


H(p"-pr\i'Z2)  =  H(pi-1'p^zl,zd  (A.  15a) 

p(pu-PT\)  =  (A.  15b) 

e(pi'-p2-1)(n1,n2)  =  e(p,-1-P2)(n1,n2)  (A.15c) 

e^-P2~%  1>n2)  =  eApi~l'p^(nl,n2)  (A.15d) 


The  latter  equations  also  appear  in  Marzetta’s  2-D  Levinson  recursion  and  give  a  true 
minimum  phase  filter  when  the  order  parameter  P2  is  infinity. 
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APPENDIX  (B) 

LEVINSON  WIGGINS  ROBINSON  (LWR)  ALGORITHM 


In  this  appendix  the  computational  procedure  of  1-D  multichannel  Levinson  recur¬ 
sion  known  as  the  LWR  algorithm  is  described.  Let  x(n )  be  a  wide  sense  stationary  M- 
channel  1-D  random  process. 

x \(n ) 
x2(n) 


X/nin) 

Then  the  forward  linear  predictive  filter  of  order  P  applied  to  the  data  can  be  expressed 
as 


where 


(B.2) 

(B.3) 

(B.4) 


a(l>  = 


ap-i.o  a p-u  •  •  •  4‘ 

and  xn  is  an  vector  of  the  ordered  values  x(n  -i ) 


aio  <*ii' 


.  .  a,' 


x(n) 
x(n- 1) 


x(n-P) 


The  corresponding  Normal  equations  are 


Ra-E 


’R(O) 

R(-l)  . 

.  .  R(-P+l) 

R(l) 

R(0)  . 

.  .  R(-P  +2) 

R(P-l)  RC/’-Z)  . 

.  .  R(0) 

where 


x(n )  xT  ( n  -k ) 


and  E(/> )  is  the  error  covariance  of  the  optimal  estimate. 


E  {P)  =  e\  (xn-x.)(xn-x„) 


Similarly  the  backward  linear  predictive  filter  can  be  written  in  the  following  form 


e.'(n-P+l)  =  P%  [p01]  T  x(n-P+l+i)  (B.ll) 

where  0^  are  the  backward  multichannel  1-D  filter  coefficient  matrices.  The 
corresponding  Normal  equations  are 

R0  =  E'  (B.12) 

I 

p(i) 

where  E'(P)  represents  the  backward  error  covariance  matrix. 

E'(/>)  =  E^e'Cn )  e'T(n )  (B.14) 

Note  that  since  from  Eqn.  (B.9)  we  have 

R(k)  =  RT(-k )  (B.15) 

and  the  blocks  of  the  correlation  matrix  in  (B.13)  are  the  transposes  of  the  blocks  of  the 
matrix  in  (B.8).  The  solution  of  Equations  (B.8)  and  (B.13)  for  forward  and  backward 
filter  coefficients  can  be  done  recursively  by  relating  the  order  (p)  to  the  order  (p-1)  as 
follows 


R(0)  R(l)  .  .  .  R(P-l) 

R(— 1)  R(0)  .  .  .  R(P  -2) 


R(— P+1)  R(-P  +2)  .  .  .  R(0) 


where  <rp-1)  and  ^  ^  are  the  block  reverses  of  a(p  and  ft?  respectively,  and 
and  K,(p)  represent  the  forward  and  backward  reflection  coefficient  (or  partial  corre¬ 
lation)  matrices  of  order  (p)  respectively.  These  can  also  be  computed  recursively 
as  follows 


where 


K(p)=  A(p) 

(B.17a) 

K/0»)=  1 

(B.17b) 

=  [Rr(l)  Rr(2)  ...  Rr(p)j  aP'V 

(B.18) 

e(p)_e(p-D  j^i_K/(p)K(p)j 

(B.19a) 

E<p)  _  E'(p-D 

(B.  19b) 

Details  of  the  derivation  of  these  results  can  be  found  in  (Strand,  1977). 


APPENDIX  (C) 

MULTICHANNEL  1-D  BURG  ALGORITHM 


A  multichannel  1-D  Burg  algorithm  was  first  developed  by  Nuttall  (1976)  and  Strand 
(1977)  independently.  The  idea  is  to  compute  the  reflection  coefficients  matrices  directly 
from  the  data  instead  of  estimating  the  autocorrelation  function  and  from  these  compute 
the  filter  parameters.  Thus  the  multichannel  1-D  Burg  algorithm  uses  the  LWR  algo¬ 
rithm  (Appendix  B),  except  that  the  reflection  coefficient  matrices  are  computed  from  the 
available  data. 

Assume  we  have  M-channels  with  N  samples  and  define  the  following  terms 


1  Nf  f 

F=-f-  Zc<p>(«)  e^>(n) 

NP  n= 1  L  ' 


55=TT-  Xe,(p)(«)  e,(p)(n) 

NP  n=l  L  J 

jy  j* 

0=-~  f  e^n)  le^n)] 

NP  n= 1  1  J 


(C.la) 


(C.  lb) 


(C.lc) 


where  NP-N-P  (P  represents  the  order  of  the  filter),  and  e(p\n )  and  e,(p  \n )  represents 
the  forward  and  backward  error  signals  respectively.  Then  the  matrix  A(p)  defined  in 
Appendix  B  is  obtained  as  the  solution  of  the  bilinear  equation 


Cj  a<^)  +  a(?)c2  =  c3 


where 


■i  =  ["eKp-uJ 
:2=[e<>-3)] 


(C.3a) 


(C.3b) 


•W 


v  V >  A  ■  »  v  V  v  V  V  V 


and  where  E^_1)  and  E'^-1'  represent  the  error  covariance  matrices  of  the  forward  and 
backward  linear  predictive  filters  respectively. 

There  are  several  approaches  that  can  be  used  to  obtain  explicit  solutions  to  Equation 
(C.2).  One  form  of  the  solution  is  given  by  Pease  (1965)  (Kucera,1974; 
Hartwig,  1972, 1975).  The  basic  idea  of  this  method  is  to  transform  Eqn.  (C.2)  into  an 
equivalent  vector  form 


Q  [c,  A^]  =  [c,  C3 

where  represents  the  column  string  of  A^ 

c,  A^Ja^  •••  A jg\  Ag>  •••  A jSl 
and  cs  C3j  represents  the  column  string  of  C3  and  where 


(C.4) 


m}T 


(C.5) 


Q=C1®/W+/W®C^  (C.6) 

where  IM  is  an  (MxM)  identity  matrix.  Thus  having  solved  Eqn.  (C.4)  for  A^,  the 
reflection  coefficient  matrices  K(/,)  and  K'^  can  be  found  from  (B.17)  and  the  forward 
and  backward  error  can  be  written  recursively  in  the  form 


[k^  Te '(p-%) 

(C.7a) 

r  ■)  T 

e^/i+D 

(C.7b) 

Now,  assuming  x(n )  is  a  an  M-channel  random  process,  the  recursive  algorithm  can  start 
with  the  following  initial  conditions: 

a(0>  =  B(0)  =  lM 


lit. 


e(0)(n  )  =  x(n) 


z*°\n )  =  x(/i ) 


E(0)  _  E'(0)  _  _L  £  e(0)(n )  j"  e'(0)(n 


where  a(0)  and  |$(0)  are  the  forward  and  backward  filter  coefficients  respectively,  of  order 
zero.  Using  Eqn.  (C.l)  the  values  F,  B,  and  G  can  be  computed  which  are  used  for  solv¬ 
ing  Eqn.  (C.4)  to  obtain  A^  \  The  reflection  coefficient  matrices  are  then  computed  from 
(B.17),  and  (B.16)  and  (B.19)  of  the  LWR  algorithm  are  used  for  computing  the  next 
order  filter  coefficients  and  error  covariances. 


•. s  %  \ 


APPENDIX  (D) 

MULTICHANNEL  2-D  BURG  ALGORITHM 


As  we  mentioned  in  Chapter  HI  section  G  a  2-D  multichannel  Burg  algorithm  can  be 
developed  as  a  straightforward  combination  of  the  2-D  Burg  algorithm  (Appendix  A), 
and  the  1-D  multichannel  Burg  algorithm  (Appendix  C).  Assume  that  the  multichannel 
2-D  random  process  x(n1,/i2)  generating  the  M-channel  data  points  is  zero  mean 

and  wide  sense  stationary.  The  present  value  x(/t1,/t2)  can  be  predicted  approximately 
from  a  linear  weighted  sum  of  the  past  values. 

x(/ti«2)=-  X  A^x(n1-i1,n2-i2) 

(MaJea  (D.  la) 

(ii.ijMO.0) 

This  is  called  a  forward  prediction,  because  xin^n^)  is  predicted  ahead,  in  terms  of  "pre¬ 
vious"  values.  Similarly  we  consider  the  backward  prediction 

x'(/»1«2)  =  -  X  Kiix(ni+ii'n2+i2> 

(«i  ,i&a  (D.lb) 


where  the  Bitij  represent  the  backward  filter  coefficients.  (Note  that  the  region  a  is  the 
same  although  the  prediction  matrices  are  not).  Thus  the  forward  and  backward  predic¬ 
tion  errors  can  be  written  as 


e(«i/»2)=  X  Ki2x(n  i-i^n  2-t2> 


e'(n !  n  2)  =  X  i+i  i>n 2+i 2) 


(D.2a) 

(D.2b) 


where  Aqq  =  Bqq  =  /.  The  forward  and  backward  filter  of  the  order  ip  l!p2)  can  be  recur¬ 
sively  estimated  from 


268 


(D.3a) 


H^1^(z1^2)  =  H(p,'rl)(z1^2)-zfpi  zipl  a?) 

H/</’l,/,j)(z1,z2)  =  H/<Pl'Pl_1)(z1^2)-zf  zf  H(Pl<Pl_1)(zr1,z2-1) 

(D.3b) 

where  and  are  the  forward  and  backward  reflection  coefficient  matrices 

of  order  ip  ^2) with  dimension  (MxM).  The  forward  prediction  error  £<Pl'P2’>(n1,n  2)  and 
the  backward  prediction  error  of  order  (p  1^2)  can  be  written  recursively 

in  the  following  forms 

,.*2)  =  „n2)  -  e'<p"pi_,>(«  ,-p  „n  2-p2) 

(D.4a) 

e^,J,l)(n  „n2)  =  e'<'’"'’w)<;.1,*2)  -  K'<P■J”,'  e<p,,!",,(n1+p  ^j+pj) 

(D.4b) 

The  reflection  coefficients  are  estimated  by  straightforward  extension  of  the  pro¬ 
cedures  for  1-D  multichannel  Burg  algorithm  (  see  Appendix  C  Eqns.  (C.2)-(C.5))  and 
will  not  be  detailed  here. 


APPENDIX  (E) 

MULTICHANNEL  2-D  INCREMENTAL 
DIVERGENCE  MEASUREMENT 


The  incremental  divergence  A/ (1,2)  between  two  multichannel  random  processes 
used  in  Chapter  HI  can  be  expressed  in  terms  of  filter  parameters  or  in  terms  of  spectral 
estimates  as  follows. 

1.  In  Terms  of  Filter  Parameters 


AJ  (1,2)  =  -  M 


tr  AjRjAJI^  ^  +  tr  A1R2A1T Lj  * 


(E.1) 


where  M  represents  the  number  of  channels.  A*  (k=l,2)  represents  the  set  of  filter 
coefficients  of  the  process  k,  Rk  is  the  entire  covariance  matrix  of  the  process  k,  and 
finally  X*  is  the  error  covariance  matrix  of  process  k. 


2.  In  Terms  of  Spectral  Estimates 


1 

AJ  (1,2)  =  -  A/  +  — j  j  jtr  £0^0)2 

87t  -n-K 

(E.2) 

n  n 

+  j  jtr  82(0)!, 03^0)2 
-n-n 

where  Sj  and  S2  are  the  M  xM  spectral  matrices  of  the  modeled  random  processes. 

Proof 


For  this  proof  it  is  important  to  note  that  we  regard  the  two  linear  predictive 
filters  as  fixed  and  that  ek(n1,n2)  k= 1,2  represents  the  output  of  the  corresponding  filter 
regardless  of  the  input.  Therefore  when  the  input  to  the  first  linear  predictive  filter  is  the 
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process  to  which  it  is  matched  the  output  1  j'1  2)  wiE  be  the  minimum  mean- square 
prediction  error  process.  Where  the  input  is  the  other  random  process  will  still  call  the 
output  £i(nx,n 2)  but  in  this  case  it  will  not  be  a  minimum  mean- square  prediction  error 
process. 

1.  In  Terms  of  Filter  Parameters 

When  the  input  to  a  linear  predictive  filter  matched  to  a  random  process  k  is 
x(n  x,n2)  the  output  ek(n1,n2 )  is  given  by 


£k(n1,n2)  =  x(nl,n2)-x(n1,n2)  (E.3) 

where 


x(rti«2)=-  X  Ak  (*  vi  2?  x(n  x-i  x,n  2-i  £ 

OVilea  (E.4) 

(ii.ijMO.O) 


and  where  a  represents  the  filter  region  of  support 

When  the  input  is  the  random  process  k  to  which  the  filter  is  matched,  the  out¬ 
put  is  the  minimum  mean-square  prediction  error  for  random  process  k.  In  this  case  the 
error  covariance  matrix  of  the  output  can  be  defined  by 


=Ek  eJt(n1,n2)et'(n1,n2) 


(E.5) 


where  Ek  denotes  expectation  under  the  hypothesis  that  the  input  x(n  1,n2)  is  random  pro¬ 
cess  k .  If  the  input  is  Gaussian  then  the  density  function  for  the  prediction  error  process 
is 


fk\zk{nx,n^  = - - - j-exp 

(27t)2  IZ*!2 

The  direct  divergence  or  the  mean  information  /e(I,2)  corresponding  to  the  error  process 
£j  and  £2  is  defined  as 


el  (ni,*2)E*1£*(n1)n2) 


(E.6) 


/£(1,2)  =  eJa12 


(E.7) 


where  Ali2  is  the  logarithm  of  the  likelihood  ratio 


AU  =  log 


/ 1 

r  *■ 

e^/ti.no) 

»  -/ 

/  2 

e2(t*i.«2) 

w  .7 

(E.8) 


Note  that  since  ek(/i1,n2)  is  uncorrelated  with  observations  of  the  random  process  that 
appear  on  the  right  side  of  (E.4),  7e(  1,2)  represents  the  change  in  mean  information  that 
occurs  when  a  new  observation  x(n1,/z2)  of  the  original  random  process  is  taken.  If  we 
substitute  (E.6)  and  (E.8)  in  (E.7)  then  after  some  algebraic  manipulation  the  direct 
divergence  can  be  written  in  the  form 


7e(l,2)  =  ^E1 


ej  £2 1  £2  -  e7  £1 1  £i  -  log 


lEjl 


Now  observe  that 


T 

£i  ei 


Thus  Eqn.  (E.9)  takes  the  form 


(E.9) 


(E.10) 


Ul^)  =  ~-  4-log 


IS,  I  1 
— — +  —  El 
I  £2 1  2  1 


ej  E21  62 


If  we  rewrite  Eqn.  (E.3)  as 


(E.l  1) 


Zk(nhn^)  =  \Jx  (E.12) 

where  Ak  is  a  matrix  of  the  filter  coefficients  (as  defined  in  Chapter  HI  section  D)  and  x 
is  a  vector  of  the  corresponding  samples  of  the  random  process,  then  the  final  expression 
for  the  direct  divergence  can  be  obtained  by  substituting  Eqn.  (E.12)  in  the  last  term  of 
Eqn.  (E.l  1) 


'«(  1.2)  =  ~- 4- log 


IS,  I  1 


ILji 


+  2  " 


A2  Rj  Aj  E;  * 


(E.13) 
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R1=E,[**r]  (E.14) 

Similarly  the  mean  information  /£( 2,1)  can  be  expressed  by 

/£(2,1)  =  -£2[au]  (H.  15) 

where  E  2[ .  ]  represents  the  expectation  under  /  2  and  this  leads  to 


M  1  1 

7C(2,1)  =  -  —  log  — —  +  —  tr 

eV '  ’  2  2  S  llji  2 


Ai  R2  Af  Ej 1 


(E.16) 


The  incremental  divergence  as  we  mentioned  in  Chapter  II  can  be  defined  as 


A/  (1,2)  =  /£(l,2)  +  /£(2,1)  (H- 17) 

Using  Eqns.  (E.13),  (E.16)  and  (E.17)  the  incremental  divergence  measurement  in  terms 
of  the  filter  parameters  can  finally  be  expressed  in  the  form  of  (E.l). 


2.  In  Terms  of  Spectral  Estimates 

The  estimated  power  spectrum  for  a  random  process  k  using  an  autoregressive 
model  can  be  expressed  as  (see  Chapter  IV): 


and 


S*(co1>co2) 


1 

4k2 


Hi  kcOi.cuj)  H^'Vci)!,^) 


Hi(co1,co2)=  £  A  f(«1*2)e‘>(“h,i,+a*,,) 


(E.l  8) 


(E.19) 


where  a  is  some  chosen  region  of  support  and  Ak(0,0)  is  the  identity  matrix.  Now  sup¬ 
pose  x(/z1,n2)  represent  either  process  1  or  2  with  zero  mean  and  spectrum  S^lcoj.o);), 
and  let  ei(nl5n2)  be  defined  by 
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Then 


e*(/ii/j2)=  £  A/(i1,i2)x(n1-ii^i2-i2) 
(»l.*2)E0t 


S^K.COi)  =  H^COi.COj)  Sx(£D1,C02)  H^CCDi.COi) 


(E.20) 


(E.21) 


By  applying  Parseval’s  theorem  and  using  Eqn.  (E.17)  and  the  commutative  properties  of 
the  trace  we  can  take  the  expectation  with  respect  to  the  random  process  x  and  write 

E  ej(n  j ,n 2)  XJ1  e* (n } ,n =tr  X* 1  E  ek(n1 ,n 2)  ej[(n hn2) 


=  tr  j  j  X*1  S^fcoj ,0)2) 


=  tr  J  J  X*1 


Jt  71 

= jUL  l2*"1  h  Sx (coj.ou^)  S*  ^coj.co^  dco^ojj 

l  nn  r  1 

=  —Ttr  !!  Sjfco^o^S* Voiced  dco^toj  (E.22) 

4ft  -n-rc  *•  -* 

Now  suppose  that  our  input  random  process  x  is  the  random  process  1.  Then  from  Equa¬ 
tion  (E.22)  we  can  say  that 

r  -1  1  *  *  1 

Ex  zl{nx,n2)  Xf1  = — -  tr  J  J  S^coj.coj)  Sf^CDj.c^)  dco^c^ 

L  J  -n-n  L  J 

=  M  (E.23) 


E\  ef(/i1,n2)  X21  E2</* 2)]  =~Vrr  f  f  I" S1(oa1,<o2)  S^’fcoj, o^)]  dto^co. 


Substituting  (E.23)  and  (E.24)  into  the  expression  for  the  direct  divergence  given  by  Eqn. 
(E.9)  we  have 


i,(\.D  =  -^r  ~  log 


i  *  *  r  _  i 

+  — 2  J  J  ^  I  S1(oo1Ia>2)  S21(to1,co2)  du>\d(ib 

'^2'  8tt  -n-n  L  J 


Similarly  we  can  show  that 


(E.25) 


M  1  I  £7 1 
/e(2,1)  =  -—  --log — - 
E  2  2  6  lljl 


1  *  * 

-n-K 


S2(co1,aj2)  S,  '(co^cut) 


(E.26) 

Substituting  (E.25)  and  (E.26)  into  (E.  17),  we  get  the  final  expression  of  the  incremental 
divergence  given  bv  (E.2). 


APPENDIX  (F) 

APPLICATIONS  TO  IMAGE  SPECTRAL  ANALYSIS 


In  this  appendix  we  developed  an  experiment  to  study  the  spectra  of  data 
corresponding  to  a  color  image  shown  in  Figure  F  la.  The  figure  represents  an  aerial 
photograph  of  a  ground  area,  (data  courtesy  of  Rome  Air  Development  Center,  Griffiss 
AFB,  N.Y.),  with  size  128xl28-pixel.  Figures  F.lb,  c,  and  d  represent  the  red,  green,  and 
blue  components  of  the  color  image. 

Two  portions  of  this  image  of  size  64x64-pixels  are  taken.  The  first  part  corresponds 
to  the  lower-left  comer  which  represents  the  field,  while  the  second  portion  lies  in  the 
upper-right  comer  and  represents  the  trees. 

Fig.  F.2a  shows  a  2-D  plot  of  the  intensity  variation  of  the  green  component  in  the 
field  image.  The  index  n  t  is  taken  as  the  horizontal  direction,  and  index  n2  as  the  verti¬ 
cal  direction.  Figure  F.2b  represents  the  one  dimensional  plot  of  Fig.  F.2a.  Different 
slices  of  the  green  component  of  the  color  image  in  the  horizontal  and  vertical  directions 
are  shown.  It  is  clear  that  the  variation  of  the  intensity  in  the  horizontal  direction  is  very 
high  while  in  the  vertical  direction  the  intensity  variation  is  very  low.  This  implies  that 
the  spectral  energy  is  concentrated  around  (1)2  =  0. 

In  a  similar  manner  Fig.  F  3  shows  a  two-dimensional  plot  of  intensity  variation  of 
the  green  component  of  the  trees  portion  of  the  color  image,  in  the  horizontal  and  vertical 
directions.  Rapid  variations  of  intensity  appear  in  the  n  j-  direction  and  very  slow  varia¬ 
tions  appear  in  the  n2'direction.  Fig-  F-2  and  Fig.  F.3  are  taken  here  only  as  an  example 
showing  the  intensity  variation.  We  could  also  show  the  intensity  variation  of  the  other 
color  image  components,  but  the  plots  are  quite  similar. 

The  color  image  is  considered  here  as  a  3-channel  2-D  random  process,  where  each 
color  component  (red,  green,  and  blue)  represents  a  separate  channel.  In  this  example 
only  red  and  green  components  are  analyzed,  where  the  red  component  is  designated  as 
channel  1  and  the  green  component  as  channel  2. 
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(a)  Color  image 


(b)  Red  component 


(r)  Crccn  component 


(d)  Blue  component 


Fig.  F.l  Color  image  of  tree?,  and  field. 
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(a)  2-D  data 


(b)  1-D  slices 


The  blue  component  for  this  image  was  very  dark  showing  very  low  signal  strength  in 
this  channel.  A  sixth  order  NS  HP  model  was  used  to  estimate  the  spectral  matrix. 

Fig.  F.4  shows  the  components  of  the  estimated  spectral  matrix  for  the  field  portion. 
The  spectrum  of  the  red  component  is  shown  in  Fig.  F.4a,  while  the  spectrum  of  the 
green  component  is  shown  in  Fig.  F.4b.  Figure  F.4c  shows  the  cross  spectrum  amplitude 
and  phase.  The  red,  green  and  magnitude  of  the  cross  spectrum  appear  to  be  very  simi¬ 
lar.  The  power  distribution  is  concentrated  around  CO2  =  0  and  the  middle  and  higher  fre¬ 
quencies  of  the  ©!  direction.  This  distribution  of  power  is  due  to  the  appearance  of  regu¬ 
larly  spaced  lines  in  the  image  corresponding  to  the  rows  of  plants  occurring  in  the  field 
image.  The  estimated  phase  of  the  cross  spectrum  component  varies  between  +10° 
and  -40°. 

Fig.  F.5  shows  the  spectral  estimate  of  the  trees  portion  of  the  color  image.  Again 
we  considered  the  red  component  as  channel  1  and  the  green  component  as  channel  2 
The  spectra  of  red  and  green  components  are  shown  in  Fig.  F  5a  and  b  respectively.  Fig¬ 
ure  F.5c  shows  the  amplitude  and  phase  of  the  cross  spectrum  component.  As  in  the  field 
portion,  the  autospectra  (red  and  green)  components  and  the  magnitude  of  the  cross  spec¬ 
tra  are  similar.  The  power  is  also  distributed  around  zero  frequency  in  u>j  direction,  but 
occur  at  lower  frequencies  in  the  oij  direction  than  in  case  of  the  field  portion.  This 
results  agrees  with  the  intensity  variation  plot  shown  in  Fig.  F.3,  which  indicates  that  the 
intensity  variations  are  not  the  same  in  horizontal  and  vertical  directions.  The  phase  of 
the  cross  spectrum  for  the  trees  image  is  also  between  10°  and  -40°. 
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APPENDIX  (G) 
VECTOR  QUANTIZATION 


In  Chapter  VI  we  discussed  scalar  quantization  and  showed  that  a  two  or  three  level 
uniform  quantizer  is  sufficient  to  achieve  good  quality  in  the  reconstructed  coded  images 
using  multichannel  predictive  coding  techniques. 

Vector  quantization  is  another  approach  to  quantization  which  is  used  for  coding  a 
vector  source.  The  data  required  to  be  coded  are  divided  into  blocks  which  are  then 
sequentially  encoded  block  by  block.  It  is  the  vector  generalization  of  the  usual  PCM 
system.  (Linde  et  al,1980;  Gray  and  Linde,  1982;  Gray, 1984;  Cuperman,1985;  Hang  and 
Woods,1985;  Makhoul,1985;  Goldberg,  1986) 

Let  the  image  be  divided  into  N  blocks  of  pixels  and  let  these  pixels  be  formed  into  a 
k-dimensional  vector  x, .  Then  the  total  image  can  be  represented  by  the  larger  vector 


(G.l) 


where 


In  vector  quantization  the  vector  x,  is  mapped  into  another  k  dimensional  sector  x 
which  is  close  to  it  in  k  dimensional  space  The  sector  x  sshich  is  transmitted  iv  then 
composed  of  the  individual  x,  value  We  call  this  the  quwnrcrJ  salue  of  x 


x  =  =q  (x) 


(G.3) 


The  vectors  x,-  are  drawn  from  a  reproduction  alphabet  (or  codebook)  y. 

A  major  issue  for  vector  quantization  is  codebook  design.  The  VQ  algorithm  most 
frequently  used  is  called  Linde-Buzo-Gray  (LBG)  vector  quantizer  algorithm  (Linde  et 
al,1980)  and  involves  a  clustering  of  the  data  in  k-dimensional  space.  A  codebook  y  with 
length  (i.e.  number  of  quantization  levels)  J  can  be  written  in  the  following  form 


(G.4) 


where 


(G.5) 


In  this  case  the  bit  rate  can  be  defined  b\ 


R  =  log  • ./  Pits  per  \t\  u  i 


—  U's 


I 


Note  that  the  number  of  quantization  levels  is  usually  restricted  to  be  a  binary  number  for 
digital  coding  applications. 

Observe  that  VQ  can  allow  fractional  rates  in  bits  per  sample  while  this  is  not  the 
case  in  scalar  quantization.  The  minimum  bit  rate  of  a  scalar  quantizer  is  1  bit  per  sam¬ 
ple  while  a  k-dimensional  VQ  can  theoretically  achieve  a  bit  rate  of  1/k  bits  per  sample. 

The  distortion  caused  by  replacing  the  original  input  vector  x  by  the  new  vector  x  is 
given  by  the  distortion  measurement  d(x,x).  The  most  widely  used  type  of  distortion 
measurement  is  the  squared-error-distortion  (Gray.  1984;  Makhoul.1985). 


d  (x.x)  =  I  I  x  -  x  i  I ' 


=  X  1 X.  —  X.  1  ■ 


(G.8) 


To  quantitatively  measure  the  performance  of  the  coding  system  we  could  use  the  aver¬ 


age  distortion  £^d(x.x)|  .  In  practice  however,  normalized  average  distortion  is  used 
instead  of  the  actual  value  of  the  average  distortion.  We  refer  to  this  normalized  average 
distortion  as  the  SNR  measurement  and  it  can  be  defined  bv 


SNR  =  10  logi( 


£!  I l x I 


£  d  (x.x ) 


(G.9) 


The  performance  of  the  color  image  predictive  coding  technique  using  VQ  and 
applying  the  multichannel  linear  predictive  models  described  in  Chapter  VI  has  been  stu¬ 
died  through  the  following  simulation  experiment.  The  original  pictures  used  in  this 
experiment  are  shown  in  Fig.  b.b.  The  simulations  were  run  on  128x128  pixel  images 
with  8  bits  per  color  component  (i.e.  24  bus  per  color  pixel). 

We  started  the  simulation  with  the  lady's  face  given  in  Fig.  6.6a.  The  predictive 
filter  coefficients  of  the  second  order  mode!  for  the  whole  image  were  obtained  by  solv¬ 
ing  a  set  of  Normal  equations.  The  corresponding  codebook  of  the  residual  signal  was 
obtained  using  the  LBG  codebook  design  algorithm  with  the  following  parameters.  The 
codebook  size  is  J=64  The  vectors  of  dimension  k  t£=3*£ jxPi)  where  3  represents  the 
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number  of  the  color  components  were  then  obtained  by  vectoring  a  block  of  dimension 
(/*  1x/>2)  of  the  picture.  Specifically  in  this  experiment  we  assumed  that  P  1=/>2=2. 

The  reconstructed  coded  lady’s  face  image  using  this  designed  codebook  is  shown  in 
Fig.  G.la  and  the  corresponding  residual  is  shown  in  Fig.  G.lb.  Note  that  we  are  using 
here  the  prediction  error  coder  system  described  in  Chapter  VI  (see  Fig.  6.3),  which  is 
not  as  good  in  quality  as  the  predictive  coding  system  (Fig.  6.2)  but  it  is  easier  to  con¬ 
struct.  The  bit  rate  in  this  specific  example  is  0.5  bit  per  sample  and  the  corresponding 
SNR  measurement  values  are  20.198  dB  for  the  red  component,  17.506  dB  for  the  green 
component,  and  14.879  dB  for  the  blue  component. 

Now,  several  filter  coefficient  matrices  were  designed  for  different  images  by  solv¬ 
ing  a  set  of  Normal  equations  corresponding  to  each  image.  Specifically  we  used  the  all 
images  given  in  Fig.  6.6  except  Fig.  6.6h.  Applying  the  third  procedure  described  in 
Chapter  VI,  we  obtained  a  fixed  filter  coefficient  matrix.  Using  the  same  codebook 
designed  before,  the  corresponding  reconstructed  image  of  the  lady’s  face  is  shown  in 
Fig.  G.lc.  The  corresponding  SNR  for  the  red  component  is  20.026  dB,  for  the  green 
component  is  17.406  dB,  and  for  the  blue  component  is  14.688  dB. 

Comparing  Fig.  G.la  with  Fig.  G.lc  we  see  that  approximately  the  same  quality  of 
pictures  are  obtained.  In  addition,  the  SNR  values  are  very  close  to  each  other.  Note  that 
the  reconstructed  pictures  are  not  of  the  same  quality  as  in  case  of  scalar  quantization 
because  we  used  here  the  prediction  error  coder  system  which  accumulates  noise  at  the 
receiver.  Also  the  block  size  used  is  very  small  ((2x2)  for  each  color  component)  and  the 
size  of  the  training  data  was  not  very  large. 

To  check  the  efficiency  of  the  third  procedure  using  the  VQ  technique  the  fixed  set 
of  filter  coefficients  and  the  designed  codebook  were  used  to  reconstruct  an  image  that 
was  not  part  of  the  training  data.  (The  house  picture  shown  in  Fig.  6.6h).  The  recon¬ 
structed  house  picture  is  shown  in  Fig.  G.ld  and  the  corresponding  SNR  in  dB  are 
16.730, 18.647,  and  19.604  for  the  red,  green,  and  blue  components,  respectively. 


v  "««  V  V  w  ■. 


-r. 


WWW 


Fig.  G.l  Reconstructed  images  using  VQ. 

(a)  Reconstructed  lady’s  face  image  (using  second  procedure) 

(b)  Error  image 

(c)  Reconstructed  lady’s  face  image  (using  third  procedure) 

(d)  Reconstructed  house  image 
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